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Chapter  1 

Introduction 


Abstract  We  define  the  framework  of  classical  physics  as  considered  in  the  present 
book.  We  briefly  point  out  its  place  in  the  history  of  physics  and  its  relation  to 
modern  physics.  As  the  prime  example  of  a  theory  of  classical  physics  we  introduce 
Newtonian  mechanics  and  discuss  its  limitations.  This  leads  to  and  motivates  the 
study  of  different  formulations  of  classical  mechanics,  such  as  Lagrangian  and 
Hamiltonian  mechanics,  which  are  the  subjects  of  later  chapters.  Finally,  we  explain 
why  in  this  book,  we  take  a  mathematical  perspective  on  central  topics  of  classical 
physics. 


Classical  physics  refers  to  the  collection  of  physical  theories  that  do  not  use  quantum 
theory  and  often  predate  modern  quantum  physics.  They  can  be  traced  back  to  Newton 
(17th  century)  and  in  some  sense  even  further  all  the  way  to  Aristotle,  Archimedes, 
and  other  Greek  philosophers  of  antiquity  (starting  in  the  4th  century  BC).  However, 
this  does  by  far  not  mean  that  theories  of  classical  physics  are  exclusively  a  subject 
of  the  past.  They  continue  to  play  important  roles  in  modern  physics,  for  example  in 
the  study  of  macroscopic  systems,  such  as  fluids  and  planetary  motions,  where  the 
effects  of  the  quantum  behavior  of  the  microscopic  constituents  are  irrelevant. 

Newtonian  mechanics  is  arguably  the  first  mathematically  rigorous  and  self- 
contained  theory  of  classical  physics.  In  its  traditional  formulation,  Newton’s  theory 
comprises  three  physical  laws  known  as  Newton’s  laws  of  motion,  describing  the 
relationship  between  a  body  (usually  assumed  to  be  a  point  particle  of  constant  mass 
m  >  0)  and  forces  acting  upon  it.  They  also  quantify  the  resulting  motion  of  the  body 
in  response  to  those  forces  and  can  be  summarised,  in  an  inertial  reference  frame,  as 
follows. 

First  law:  A  body  at  rest  will  stay  at  rest  and  a  body  in  uniform  motion  will  stay  in  motion 
at  constant  velocity,  unless  acted  upon  by  a  net  force. 
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Second  law:  The  force  F  acting  on  a  body  is  equal  to  the  mass  of  the  body  times  the  acceleration 
a  of  the  body: 

F  =  ma  .  (1.1) 

Third  law:  For  every  action  force  there  is  a  corresponding  reaction  force  which  is  equal  in 
magnitude  and  opposite  in  direction.  This  is  often  abbreviated  as  actio  =  reactio. 

Considering  a  point  particle  moving  in  IR3,  its  acceleration  at  time  t  is  given  by  the 
second  time  derivate  of  its  position  vector,  denoted  by  a(t)  =  x.  Throughout  the 
entire  book,  we  work  in  the  C°°-category,  unless  stated  otherwise,  that  is,  manifolds 
and  maps  between  them  are  usually  assumed  to  be  smooth.  The  force  F  is  generally 
allowed  to  depend  on  position  x,  velocity  x,  and  time  t,  that  is  F  =  F(x,x,  t).  Eq.  (1.1) 
then  turns  into 

x=—F(x,x,t).  (1.2) 

m 

For  m  and  F(x,x,t)  given,  this  is  a  set  of  second-order  ordinary  differential  equations 
known  as  Newton ’s  equations  of  motion.  Note  that  F  =  0  if  and  only  if  the  motion 
t  — >  x(t)  is  linear  and  therefore  Newton’s  first  law  is  a  special  case  of  the  second 
law.  Also,  the  third  law  is  a  consequence  of  the  second  law  in  combination  with 
conservation  of  momentum,  which  ultimately  follows  from  translational  invariance 
(see  Corollary  2.15).  Despite  this  redundancy  in  Newton’s  laws,  all  three  highlight 
different  aspects  of  important  concepts  of  modern  physics,  namely  the  notion  of 
inertial  frame  in  Einstein’s  theory  of  relativity  and  the  relation  between  different 
concepts  of  mass  (inertial  versus  gravitational)  in  gravitational  theories  (see,  for 
example,  [5]). 

We  remark  that  eq.  (1.1)  is  also  sometimes  written  as  F  =  p,  where  p  =  mx 
is  the  momentum  of  the  particle.  This  allows  for  the  consideration  of  bodies  with 
non-constant  mass,  such  as  rockets  consuming  their  fuel,  where  however  the  second 
law  needs  to  be  applied  to  the  total  system  including  the  lost  (or  gained)  mass. 

In  order  to  solve  eq.  (1.2)  for  a  given  mechanical  system,  one  first  needs  to 
determine  an  actual  expression  for  the  net  force  F(x,x,t).  However,  this  can  be 
intricate  in  practice,  especially  when  constraint  forces  are  present.  Typically  for 
constraint  forces  it  is  easy  to  describe  the  constraints  geometrically,  while  it  is  difficult 
to  explicitly  obtain  the  corresponding  function  F(x,x,t).  For  example,  consider  a 
ball  moving  on  the  surface  of  a  flat  table.  Geometrically,  the  constraint  imposed  by 
the  presence  of  the  table  implies  that  the  vertical  component  of  x  (usually  denoted 
by  x3)  is  held  fixed.  On  the  other  hand,  incorporating  this  constraint  into  eq.  (1.2) 
amounts  to  specifying  an  actual  functional  expression  for  the  force  exerted  by  the 
table  on  the  ball.  Another  problem  with  Newton’s  formulation,  partly  related  to  the 
previous  issue,  is  the  intrinsic  use  of  Cartesian  coordinates.  Indeed,  changing  to  a 
different  coordinate  system  is  generally  cumbersome. 

Given  these  limitations,  one  is  led  to  consider  more  geometric  formulations,  such 
as  Lagrangian  and  Hamiltonian  mechanics,  in  which  generalized  coordinates  are 
introduced.  These  two  formulations  were  introduced  by  Lagrange  in  1788  and  by 
Hamilton  in  1833,  respectively.  They  can  be  adapted  to  the  mechanical  system  at 
hand,  for  example  in  order  to  incorporate  geometrically  the  presence  of  constraint 
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forces.  In  addition,  symmetries  can  be  identified  more  straightforwardly  and  exploited 
more  efficiently  in  these  alternative  formulations  of  classical  mechanics.  Historically, 
in  particular  Hamilton- Jacobi  theory  -  yet  another  formulation  -  has  also  played  an 
important  role  in  the  development  of  quantum  mechanics  [3,  7,  18].  It  has  also  paved 
the  way  for  the  development  of  classical  field  theory,  which  incorporates  Einstein’s 
theory  of  relativity  and  nowadays  underlies  many  areas  of  modern  physics. 

For  these  reasons,  a  profound  understanding  of  these  central  areas  of  classical 
physics  is  very  important  and  hence,  many  textbooks  on  these  subjects  exist,  such 
as  for  example  refs.  [1,  2,  7,  8].  However,  many  of  these  textbooks  require  a  strong 
background  in  physics  and  do  not  put  strong  emphasis  on  mathematical  rigor.  This 
presents  difficulties  for  the  more  mathematically  inclined  reader. 

As  a  complementary  contribution  to  the  existing  literature,  the  present  book  takes 
a  different,  namely  more  mathematical,  perspective  at  these  central  topics  of  classical 
physics.  It  puts  emphasis  on  a  mathematically  precise  formulation  of  the  topics 
while  conveying  the  underlying  geometrical  ideas.  For  this  purpose  the  mathematical 
presentation  style  (definition,  theorem,  proof)  is  used  and  all  theorems  are  proved.  In 
addition,  the  field  theory  part  is  formulated  in  terms  of  the  theory  of  jet  bundles,  high¬ 
lighting  the  relativistic  covariance.  Jet  bundles  do  not  receive  widespread  attention 
in  the  physics  literature  up  to  now.  Each  chapter  of  the  book  is  accompanied  by  a 
number  of  exercises,  which  can  be  found  collectively  in  Appendix  A.  The  interested 
reader  is  highly  encouraged  to  try  the  exercises,  as  this  will  greatly  help  in  gaining  a 
deeper  understanding  of  the  subject. 

This  book  grew  out  of  a  lecture  course  on  “mathematical  methods  of  classical 
physics”  held  in  the  winter  semester  of  2015/2016  as  part  of  the  master’s  program 
in  mathematics  and  mathematical  physics  at  the  Department  of  Mathematics  at  the 
University  of  Hamburg.  It  is  therefore  primarily  directed  at  readers  with  a  background 
in  mathematical  physics  and  mathematics.  Also,  physicists  with  a  strong  interest  in 
mathematics  may  find  this  text  useful  as  a  complementary  resource. 


Chapter  2 

Lagrangian  Mechanics 


Abstract  In  this  chapter,  we  lay  out  the  foundations  of  Lagrangian  Mechanics.  We 
introduce  the  basic  concepts  of  Lagrangian  mechanical  systems,  namely  the  Lag¬ 
rangian,  the  action,  and  the  equations  of  motion,  also  known  as  the  Euler-Lagrange 
equations.  We  also  discuss  important  examples,  such  as  the  free  particle,  the  har¬ 
monic  oscillator,  as  well  as  motions  in  central  force  potentials,  such  as  Newton’s 
theory  of  gravity  and  Coulomb’s  electrostatic  theory.  Highlighting  the  importance  of 
symmetries,  we  study  integrals  of  motion  and  Noether’s  theorem.  As  an  application, 
we  consider  motions  in  radial  potentials  and,  further  specializing  to  motions  in  New¬ 
ton’s  gravitational  potential,  we  conclude  this  section  with  a  derivation  of  Kepler’s 
laws  of  planetary  motion. 


2.1  Lagrangian  mechanical  systems  and  their  equations  of 
motion 

Definition  2.1.  A  Lagrangian  mechanical  system  is  a  pair  (M,  Jz?)  consisting  of  a 
smooth  manifold  M  and  a  smooth  function  on  TM.  The  manifold  M  is  called  the 
configuration  space  and  the  function  is  called  the  Lagrangian  function  (or  simply 
the  Lagrangian)  of  the  system. 

More  generally,  one  can  allow  for  the  Lagrangian  to  depend  explicitly  on  an  extra 
variable  interpreted  as  time. 

Example  2.2.  The  Lagrangian  of  a  point  particle  of  mass  m  >  0  moving  in  Euclidean 
space  R”  under  the  influence  of  a  potential  V  G  C°°(R”)  is 

ff(v)=^m(v,v)—V(x),  v  G  Tx R” ,  x  G  R" , 

where  (•,•)  denotes  the  Euclidean  scalar  product.  Given  a  smooth  cur\’e  t  H >  y{t) 
in  R",  the  quantities  E^n(t)  =  \mlf/ (t),Y (/)}  and  Epot(t )  =  V (y(f ))  are  called  the 
kinetic  energy  and  the  potential  energy  at  time  t,  respectively.  Their  sum 
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E(t)  =  £kin(0  +  £pot(0 

is  the  total  energy  at  time  t. 

By  specializing  V  one  obtains  many  important  mechanical  systems: 

1.  The  free  particle:  V  =  0. 

2.  The  harmonic  oscillator:  n  =  1  and  V  =  \kx2,  where  k  is  a  positive  constant 
known  as  Hooke’s  constant. 

3.  Newton’s  theory  of  gravity:  n  =  3  and 


KmM 


where  r  =  \x\  denotes  the  Euclidean  norm  of  the  vector  x,  M  >  0  is  the  mass  of  the 
particle  (placed  at  the  origin  0  G  IR3)  generating  the  gravitational  potential,  and 
K  is  a  positive  constant  known  as  Newton’s  constant.  (Notice  that  in  this  case  V  is 
not  defined  at  the  origin,  so  the  configuration  space  is  IR3  \  {0}  rather  than  IR3.) 
4.  Coulomb’s  electrostatic  theory:  n  =  3  and 

y  _  keq\qi 
r  ' 

where  q\  is  the  charge  of  the  particle  of  mass  m  moving  under  the  influence  of  the 
electric  poten  tial  generated  by  a  particle  of  charge  qz  and  ke  is  a  positive  constan  t 
known  as  Coulomb’s  constant  Notice  that  up  to  a  constant  factor  Coulomb’s 
potential  is  of  the  same  type  as  Newton 's  potential.  However,  contrary  to  Newton ’s 
potential,  the  Coulomb  potential  can  have  either  sign.  As  we  shall  see,  this  corres¬ 
ponds  to  the  fact  that  electric  forces  can  be  attractive  or  repulsive,  depending  on 
the  sign  of  the  product  q\qz,  whereas  gravitational  forces  are  always  attractive. 
Another  important  difference  is  that  the  gravitational  potential  contains  the  mass 
m  as  a  factor  whereas  the  electric  potential  does  not.  As  we  shall  see,  the  former 
property  implies  that  the  acceleration  y"  of  the  particle  in  Newton ’s  theory  of 
gravity  is  independent  of  its  mass. 

Example  2.3.  More  generally,  given  a  smooth  function  V  on  a  pseudo-Riemannian 
manifold  ( M,g ),  one  can  consider  the  Lagrangian 

% (v)  =  *  g(v,  v)  -V(x),  v  G  TXM ,  x  G  M. 

Definition  2.4.  Let  (M,flfl)  be  a  Lagrangian  mechanical  system.  The  action  of  a 
smooth  curve  y :  [a,  b]  — >  M  is  defined  as 

S(y)  :=  fb se (i (t))dt. 

J  a 

A  motion  of  the  system  is  a  critical  point  of  S  under  smooth  variations  with  fixed 
endpoints.  (This  statement  is  a  mathematical  formulation  of  Hamilton’s  principle  of 
least  action,  which  should  better  be  called  principle  of  stationary  action.) 
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Next  we  will  derive  the  equations  of  motion ,  which  are  the  differential  equa¬ 
tions  describing  critical  points  of  the  action  functional.  These  are  known  as  the 
Euler-Lagrange  equations  and  are  given  in  (2.3).  For  this  we  introduce  coordin¬ 
ates  (at,  . . .  ,xn)  on  an  open  subset  U  C  M.  They  induce  a  system  of  coordinates 
( q,q )  =  (q1,...,qn,q1,...,qn)  on  the  open  subset  TU  =  n~l(U)  C  TM,  where 
n  :  TM  — >  M  denotes  the  canonical  projection.  The  induced  coordinates  are  defined 

by 

q‘:=x'on,  q‘(v)  :=  dxl(v),  vGTU. 

Let  js  ■  [« ■  b]  — >  M,  —e  <$<£,£>  0,  be  a  smooth  variation  of  y :  \a.  b]  — >  A7  with 
fixed  endpoints.  Then  in  local  coordinates  as  above  we  compute 


where  /  :=  x‘  o  y  and  y1  :=  (/')'.  Note  that  here  and  throughout  the  rest  of  the  book, 
we  use  an  adapted  version  of  Einstein ’s  summation  convention,  where  repeated  upper 
and  lower  indices  are  to  be  summed  over.  This  is  indicated  by  a  plain  sum  symbol 
(see  also  page  xi).  If  we  denote  by  f  :=  ^  |  j  0  y,  the  variation  vector  field  along  y, 
then  we  can  rewrite  this  as 


d 

ds 


*tfs)  = 

s= 0 


(Yyr  +  ^-iyir’) 


where  Yl  :=  dx'(fV)  =  q\y)  are  the  components  of  "V  and 

Notice  that  the  local  vector  field  iLver  =  "E1  -jA  along  /  is  independent  of  the 

choice  of  local  coordinates  and,  hence,  defines  a  global  vector  field  along  /.  This 
follows  from  the  fact  that  it  corresponds  to  "V  under  the  canonical  identification  of 
Ty/^M  with  the  vertical  tangent  space 

Tftt)(TM)  :=kcrdn\y{t)  C  Ty{t)(TM). 

Using  this  vector  field  we  can  rewrite  the  function  /  as 


f  =  |y. 

This  shows  that  also  /  is  globally  defined  on  the  interval  [a,b\  and  vanishes  at  the 
endpoints.  Therefore  we  have 
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d 

ds 


S(Ys) 

1=0 


fb  rb  U 


i=0 


1=0 


(2.1) 

(2.2) 


where  the  integrand  is  a  globally  defined  function  on  the  interval  \a,b\  with  the 
following  local  expression: 

*m)  -  /«) = E  (§f  (r'M)  ■ -  ~  O'  wi)  no. 


d 

ds 


Theorem  2.5.  Let  (M,2Sf)  /;<?  a  Lagrangian  mechanical  system,  n  =  dimM.  The 
motions  of  the  system  are  the  solutions  y:  [a,/?]  — »  M  of  the  following  system  of 
ordinary  differential  equations: 


a‘:=w(y)~*w(y>  =  0’ 


(2.3) 


Proof  Let  us  first  remark  that  for  every  vector  field  "V  along  y  the  function 
Y  ctjf/l  =  |  Q  ff(ff)  —  f  is  coordinate  independent  and  thus  globally  well-defined. 

From  this  we  can  deduce  that  a  =  Y  fXylx'  \  y  is  a  well-defined  one-form  along  y  (cf. 
Appendix  A,  Exercise  1).  In  virtue  of  (2.2),  we  see  that  y  is  a  motion  if  and  only  if 


a(r)\,dt  =  0. 


for  all  vector  fields  "V  along  y  vanishing  at  the  endpoints.  To  see  that  this  implies  that 
cc(to)  =  0  for  all  to  £  (a,b)  we  take  8  >  0  such  that  y([fo  —  8, to  +  5])  is  contained 
in  a  coordinate  domain  and  consider  =  ha,,  where  h  >  0  is  a  smooth  function 
on  \a,b\  with  support  contained  in  (to  —  8, to  +  5)  such  that  h(to)  >  0.  This  defines 
a  smooth  vector  field  f  along  y  vanishing  at  the  endpoints  and  0  =  ff  n(  V)  \ ,dt  = 
.Q-S  hYa?dt  now  implies  that  a  (to)  =  0.  This  proves  that  a  \(a^)  =  0  and,  by 
continuity,  a  =  0.  □ 


Definition  2.6.  The  one-form  a  along  y  will  be  called  the  Euler-Lagrange  one-form. 

Example  2.7.  Let  (M,g)  be  a  pseudo-Riemannian  metric  and  consider  the  Lag¬ 
rangian  f£(y)  =  jg(v,  v),  v  £  TM.  In  canonical  local  coordinates  (ql , . . .  ,q" , 
ql , . . . ,  qn)  on  TM  associated  with  local  coordinates  (x1 , . . .  ,xJ1)  on  M  it  is  given  by 


-Sf= 


where  g  =  Ygijdx’dx 1  and  gij  =  gjj  o  n.  We  compute 
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dq' 
dj? 
dq 1 


d8kj 

dx‘ 


kzj 


on  q  q 


i  =  L.mj> 


d££ . 


w(Y)  =  ^ 


1  v-  d8kj 


dxl 


(y)rV  > 


dq‘ 


t(Y)  =  L^(r)f , 


d  d%(Y)  =  Lxf  (r)^77'+L^;(7)f , 


c/f  <9g' 


dxk 

-*  =  Iwlrif  +  jl  (|^W+^W-  §?«) 
=  5><(r)  (V  ■ 


This  shows  that  the  Euler-Lagrange  equations  are  equivalent  to  the  geodesic  equa¬ 
tions 

f+'Lrfk(r)tif=° ,  £=i,...,«. 

Using  the  covariant  derivative  V  f/tey  can  be  written  as  Vy"/  =  0  or  =  0. 

Proposition  2.8.  Let  V  be  a  smooth  function  on  a  pseudo-Riemannian  manifold 
{M,g)  and  consider  the  Lagrangian 

-S?(v)  =  ^(v,v)-y(n;(v)),  vGTM,  (2.4) 

of  Example  2.3.  Then  a  curve  y  :  I  ^  M  is  a  motion  if  and  only  if  it  satisfies  the 
following  equation 

J/  +  gradV|r  =  0  (2.5) 

Proof.  We  have  already  shown  that  the  Euler-Lagrange  one-form  in  the  case  V  =  0 
is  given  by  a  =  —  g  ( •  It  remains  to  compute  the  contribution  of  the  potential 
V  to  the  Euler-Lagrange  one-form,  which  is 


jo**-- "V 


So  in  total  we  obtain 


and,  hence. 


-a=s(|/,-)+^7 
-g_Ia  =  ^y  +  gradV|r, 
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where  g  1  :  T*M  —tTM  denotes  the  inverse  of  the  map 
g:TM^T*M,  vh>j(v,-). 


□ 


2.2  Integrals  of  motion 

Definition  2.9.  Let  (M,  Jzf)  be  a  Lagrangian  mechanical  system.  A  smooth  function 
f  on  TM  is  called  an  integral  of  motion  if  it  is  constant  along  every  motion  of  the 
system,  that  is  for  every  motion  y.  I  —y  M  the  function  /(/)  is  constant. 

Obviously,  the  integrals  of  motion  are  completely  determined  by  the  equations 
of  motion  and  do  not  depend  on  the  precise  Lagrangian.  Therefore,  the  notion  of 
an  integral  of  motion  is  meaningful  if  we  are  just  given  a  system  of  second  order 
differential  equations1  for  a  curve  y :  /  — >  M  i  n  a  smooth  manifold  M. 

Proposition  2.10  (Conservation  of  energy).  Let  V  be  a  smooth  function  on  a  pseudo- 
Riemannian  manifold  ( M,g )  and  consider  the  Lagrangian 

■Z’(v)  =  ^g(v,v)-V(n{v)),  v  e  TM,  (2.6) 

of  Example  2.3.  Then  the  total  energy 

vh>E(v)  :=  ^g(v,v)+V{n(v)) 

is  conserved,  that  is  it  is  an  integral  of  motion.  IfV  is  constant  then  the  motion  is 
geodesic. 

Proof.  We  compute  the  derivative  of  r  i — L(f  (t))  along  a  motion  y: 

I  (yb'.iO+nr))  +dv\ry  (2=?o. 

If  V  is  constant  then  the  equation  of  motion  (2.5)  reduces  to  the  geodesic  equation. 

□ 

Remark.  The  last  statement  of  the  proposition  is  related  to  conservation  of  momentum 
in  Newtonian  mechanics.  Recall  from  Chapter  1  that  the  momentum  of  a  particle 
y :  I  — >  IR"  of  mass  m  moving  in  Euclidean  space  is  p  =  mf .  Newton’s  second  law  of 
mechanics  has  the  form  p'  =  F,  where  F  is  the  force  acting  on  the  particle,  which  may 
depend  on  the  position  y  and  momentum  p  of  the  particle.  Obviously,  F  =  0  implies 

1  Such  a  system  will  be  usually  given  by  a  consistent  specification  of  a  system  of  second  order 
differential  equations  for  the  components  of  the  curve  in  each  local  coordinate  system.  A  typical 
example  is  (2.3). 
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that  the  momentum  is  constant  along  every  motion.  The  conservation  of  energy  holds 
in  Newtonian  mechanics  if  the  force  field  F  depends  only  on  the  position  and  is  a 
gradient  vector  field.  In  fact,  then  we  can  write  F  =  —  grad  V  for  some  function  V  on 
R"  and  hence,  Newton’s  equation  is  a  special  case  of  (2.5). 

Definition  2.11.  Let  (M,  .'X )  be  a  Lagrangicin  mechanical  system.  A  dijfeomorphism 
tp  :  M  — >  M  is  called  an  automorphism  of  the  system  if 


Jf(dtpv)=J?(v), 

for  all  v  G  TM.  A  vector  field  X  £  X(M)  is  called  an  infinitesimal  automorphism  if 
its  flow  consists  of  local  automorphisms  of 

Theorem  2.12  (Noether’s  theorem).  With  every  infinitesimal  automorphism  X  of 
a  Lagrangian  mechanical  system  (M,2z?)  we  can  associate  an  integral  of  motion 
f  =  £/2zf2fver,  where  XveT  £  X(TM)  denotes  the  vertical  lift  of  X.  (In  local  coordinates 


■  have  Xver  =  E(X‘  o  n)  ^  ifX  =  £X' £.) 


Proof.  Let  us  denote  by  (ps  the  flow  of  X.  For  every  motion  y :  I  — >•  M  of  the  system 
with  values  in  the  domain  of  definition  of  (ps  the  curve  ys  =  <pv  c  y  is  again  a  motion 
and 

Ys=d(psY,  —  Ys=X\y, 


ds 


1=0 


where 


TM3vv+X(  v)= 

ds 


d<ps(v ) 


^=0 


denotes  the  vector  field  on  TM  the  flow  of  which  is  d(ps.  Differentiating  the  equation 
no  d(ps  =  (pso7l  with  respect  to  s,  we  obtain  dnoX  =X  on.  In  local  coordinates  (x! ) 
on  M  and  corresponding  local  coordinates  (q\q‘)  on  TM  this  means  that 


The  locally  defined  functions  Y‘  on  TM  can  be  computed  at  v  G  TM  as  follows: 


Y\v)  =  dqlX{v)  =  — 


1=0 


q'(d(psv)  =  — 


i=0 


V  I 


d(Ps  d 
dxl  dxk 


d_ 

ds 


1=0  j  dxJ 


where  tp‘.  :=  x‘  o  tps  and  we  have  used  that  4- 


i=0 


tpls  =X'.  The  above  implies  that 


0=1- 


&(Y)=dJ?X\y  =  Yll-j-r 


x\Y)+d-^i 

y  dq' 


dX\f)  .  (2.7) 
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Using  the  Euler-Lagrange  equations  we  can  now  compute 


d  ^  dL£ 

dt  dq' 


x‘(y) 

y 


dx\i)  [2=  o 
y 


□ 

Proposition  2.13.  The  group  of  automorphisms  of  the  Lagrangian  system  of  Ex¬ 
ample  2.3  is  given  by  the  Lie  subgroup 

Aut =  {(p  G  Isom(M,g)|V  o  (p  =  V}  C  Isom(M,g). 

Its  Lie  algebra  consists  of  all  Killing  vector  fields  X  such  that  X(V)  =  0. 

Proof  See  Appendix  A,  Exercise  6.  □ 

Corollary  2.14.  Let  V  be  a  smooth  function  on  a  pseudo-Riemannian  manifold 
(M,g)  and  consider  the  Lagrangian  fif(v)  =  ^g(v,  v)  —  V  (7r(v)),  v  GTM,  of  Example 
2.3.  Then  every  Killing  vector  field  X  such  that  X(V)  =  0  gives  rise  to  an  integral  of 
motion  f(v)  =  g(v,X(7r(v))),  v  G  TM. 

Corollary  2.15.  ( Conservation  of  momentum )  Consider,  as  in  the  previous  corollary, 
the  Lagrangian  of  Example  2.3.  Assume  that  with  respect  to  some  coordinate  system 
on  M  the  metric  g  and  the  potential  V  are  both  invariant  under  translations  in  one  of 
the  coordinates  x'.  Then  the  function 

Pi  ■=  Y.SiAJ 

is  an  integral  of  motion  on  the  coordinate  domain. 

Since  the  Euclidian  metric  is  translational  invariant  we  have  the  following  special 
case  of  Corollary  2.15. 

Corollary  2.16.  Let  V  =  V  (x2, . . .  ,xJ1 )  be  a  smooth  function  on  Euclidean  space 
R"  which  does  not  depend  on  the  first  coordinate  and  consider  the  Lagrangian 
2z?(v)  =  ^m(v,v)  —  V(n(y)),  vG  TIR",  of  Example  2.2.  Then  the  first  component  p\ 
of  the  momentum  vector  p  =  Y,Piei an  integral  of  motion,  where  Pi{v)  :=  m(v,ei), 
v  G  riR". 

Note  that  the  result  still  holds  if  we  replace  the  Euclidean  scalar  product  by  a 
pseudo-Euclidean  scalar  product,  such  as  the  Minkowski  scalar  product. 

Corollary  2.17  (Conservation  of  angular  momentum).  Let  V  be  a  smooth  radial 
function  on  IR3  \  {0},  that  is  V  depends  only  on  the  radial  coordinate  r,  and  consider 
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the  Lagrangian  2z?(v)  =  jm(v,  v)  —  V ( 7T (v) ),  v  €  T(l R3  \  {0}).  Then  the  components 
of  the  angular  momentum  vector 


(xl\  /  v1  \ 

L(v)  =  n{v)  x  p(v)  =  m  x2  x  v2 

V*3/  VV 

are  integrals  of  motion.  Here,  x1,*2,*3  denote  the  components  of  x  =  71  (v). 

Proof  This  can  be  proven  either  directly  from  the  equations  of  motion  or  by  applying 
Noether’s  theorem  (see  Appendix  A,  Exercise  9).  The  first  proof  uses  the  fact  that 
the  moment  of  force  y  x  F(y),  F  =  —  grad  V ,  is  zero  for  every  curve  y :  /  — »  R3  \  {0} 
and  yields  also  the  following  proposition.  □ 

Definition  2.18.  A  vector  field  F  on  IR3  \  {0}  is  called  radial  if  there  exists  a  radial 
function  f  on  R3  \  {0}  such  that  F(x)  =  f(x)  ^  for  all  x  £  IR3  \  {0}. 

Proposition  2.19.  Let  F  be  a  smooth  radial  vector  field  on  IR3  \  {0}.  Then  the 
angular  momentum  is  constant  for  every  solution  of  Newton ’s  equation  jpp(t)  =  F(t), 
where  we  are  using  the  usual  notation  p(f)  =  p(y,(t))  and  F(t )  =  F(y(t)). 

Remark.  A  function  or  vector  field  on  IR3  \  {0}  is  radial  if  and  only  if  it  is  spherically 
symmetric,  that  is  invariant  under  SO(3)  (or,  equivalently,  0(3)),  see  Appendix  A, 
Exercises  1 1  and  12.  The  name  is  due  to  the  fact  that  SO(3)  is  the  group  of  orientation 
preserving  isometries  of  the  sphere  S 2  and  0(3)  =  Isom  (.S’2). 

In  the  next  section  we  will  see  how  to  use  the  conservation  of  energy  and  angular 
momentum  to  analyze  the  motion  in  a  radial  potential. 


2.3  Motion  in  a  radial  potential 

We  consider  the  Lagrangian  ££{y)  =  \ m(v,v )  —  V(r)  on  IR3  \  {0},  where  V  is  a 
smooth  function  of  the  radial  coordinate  r  =  |x|  alone.  Since  the  mass  can  be  absorbed 
into  the  definitions  of  V  and  2§?  (denoting  .1J / m  and  V  fin  again  by  f£  and  V )  we 
may  as  well  put  m  =  1.  So  from  now  on  we  consider  a  particle  of  unit  mass. 

We  know  by  the  results  of  the  previous  section  that  the  energy  E{t)  =  j  (Y(t),  /(f)) 
+V(|y(f)|)  and  angular  momentum  L(f)  =  y(f)  x  /(t)  are  constant  for  every  motion 
y :  /  — >-  (R3  \  {0}  of  the  system. 

The  first  observation  is  that  the  conservation  of  angular  momentum  implies  that 
the  motion  is  planar,  that  is  contained  in  a  plane. 

Proposition  2.20.  If  the  vector  L  =  y  x  Y  is  constant  along  the  curve  y  :  1  — > 
[R3  \  {0}  then  y  is  a  planar  curve. 

Proof.  If  L  =  0  then  y  is  a  radial  curve  (see  Appendix  A,  Exercise  7)  and  thus  planar. 
Therefore  we  can  assume  that  L  is  a  nonzero  constant  vector.  Since  the  cross  product 
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of  two  vectors  is  always  perpendicular  to  both  of  them,  we  know  that  y(f)  G  L  for 
all  t.  So  y  is  contained  in  the  plane  Lx.  □ 

By  a  change  of  coordinates  we  can  assume  that  the  motion  is  restricted  to  the  plane 
e-j  and  use  polar  coordinates  (r,  (p )  in  that  plane  rather  than  Cartesian  coordinates 
(x1 ,x2)  to  describe  the  motion.  In  these  coordinates,  the  Euclidean  metric  is  dr2  + 
r2d(p 2  and,  hence, 

£=  *  (f2  +  r2<p2)+V(r).  (2.8) 

To  compute  the  cross  product  y  x  /  in  polar  coordinates  we  recall  that 

dr  =  cos  (pdi  +  sin<p<92,  <9<p  =  —  rsitupdi  +  r  cos  (pd 2, 

where  we  have  abbreviated  dr  =  d/dr,  d(fi  =  d/d(p  and  d,  =  d/dx'.  From  these 
relations  we  easily  obtain 

dr  x  dv  =  re  3 

and  hence 

L  =  y  x  /  =  rd,  x  ( rdr  +  (pdy)  =  rcpe^. 

Changing  the  time  parameter  t  to  —t,  if  necessary,  we  can  assume  that  the  constant 
1-  (p  >  0  and,  hence, 

L  :=  |L|  =  r2(p.  (2.9) 

Substituting  (p  =  L/r2  into  the  energy  equation  (2.8)  we  arrive  at 

1  Lr 

E=-r2  +  VeS(r ),  Vef£(r):=V(r)  +  ^,  (2.10) 

where  Veff  is  called  the  effective  potential.  As  a  consequence,  we  obtain  the  following 
result. 

Theorem  2.21.  Consider  a  particle  of  unit  mass  moving  in  a  radial  potential  V  (r) 
in  [R3.  Then  the  radial  coordinate  obeys  the  equation  of  motion  r  =  —  V_g(r)  of  a 
particle  1 1— »  r(t)  >  0  moving  in  the  Euclidean  line  under  the  potential 

yeff(r):=V(r)  +  ^,  (2.11) 

which  reduces  to  the  first  order  equation 

f  =  ±v/2(£— Veff(r))  (2.12) 

solvable  by  separation  of  variables.  The  angular  coordinate  is  then  given  by 

(P(f)  =  L  [  -n2+<P(°)> 

Jo  r(s)z 

where  the  initial  value  (p(  0)  is  freely  specifiable  or,  as  a  function  of  r,  by 
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< P(r )  =  ±L 


ds 

;2^/2(E-Veif(sf)+(P> (r°  ' 


Proof.  The  energy  equation  (2.10)  immediately  implies  (2.12),  which  can  be  solved 
by  separation  of  variables: 


Integration  yields  t  as  a  function  of  r.  Then  r  as  a  function  of  t  is  the  inverse  of  that 
function.  Differentiating  (2.12)  gives  Newton’s  equation  r  =  Given  r  as  a 

function  of  t,  the  angular  coordinate  is  now  determined  from  (p  =  L/r 2  by  integration 
as  claimed.  Finally,  to  express  <p  as  a  function  of  r  one  does  not  need  to  know  the 
inverse  of  the  function  r  H >•  t(r)  but  can  proceed  as  follows: 


L 


rV2(£-Veff(r))’ 


□ 


Next  we  specialize  the  discussion  to  Newton’s  potential  V(r)  =  —y,  from  which 
we  will  derive,  in  particular,  Kepler’s  laws  of  planetary  motion. 


2.3.1  Motion  in  Newton’s  gravitational  potential 

By  (2. 1 1)  we  know  that  the  motion  in  Newton’s  gravitational  potential  is  governed 
by  the  following  effective  potential 


(2.13) 


We  will  assume  that  L  >  0;  the  case  L  =  0  is  treated  in  Appendix  A,  Exercise  8.  The 
qualitative  behavior  of  the  effective  potential  is  essential  for  the  qualitative  analysis 
of  the  possible  orbits.  The  proof  of  the  next  proposition  is  elementary. 

Proposition  2.22.  The  function  f  :  IR>0  — >  IR  defined  in  (2.13)  has  the  following 
properties: 

1.  f  has  a  unique  zero,  at  ro  :=  jfij. 

2.  It  is  positive  for  r  <  ro  and  negative  for  r  >  ro . 

3.  lim,.^o/(r)  =  °°>  =  0. 

4.  f  has  a  unique  critical  point,  at  rm;n  :=  ^  >  rp,  where  f  attains  its  global 
minimum  f(rmin)  =~^j- 

5.  f  is  strictly  decreasing  for  r  <  r^  n  and  strictly  increasing  for  r  >  rm  ;n. 

Proposition  2.23. 
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(i)  The  range  of  possible  energies  for  motions  with  nonzero  angular  momentum  in 

Newton ’s  gravitational  potential  is  given  by  the  interval  —  ■  °°)  •  The  motions 

are  unbounded  if  E  >  0  and  bounded  if  E  <  0.  The  orbit  of  minimal  energy 
Emm  =  —  ||j  is  a  circle  of  radius  lM . 

( ii)  The  perihelion  distance  rper,  that  is  the  minimal  distance  to  the  origin,  for  a  given 
energy  E  >  0  (and  L  >  0)  is  given  by  the  unique  positive  solution  of  the  equation 

r2  +  § r  -  yg  =  0,  which  is  rper  =  (~  1  +  •  F°r  £min  <  E  <  0 


fcrnfif  L  >  0)  the  distance  to  the  origin  varies  between  the  two  positive  solutions 
of  the  equation  r2  -\-^r  —  kg  =  0,  which  are  rper  =  Mr  (  1  —  • 


and 


raph  ~  7\E\  ^  + 
to  the  origin ). 


1+2 


l2e 

~mt 


( the  aphelion  distance,  that  is  the  maximal  distance 


Proof  The  first  part  follows  immediately  from  Proposition  2.22.  For  the  second  part 
note  that  at  the  points  of  minimal  or  maximal  distance  to  the  origin  we  have  r  =  0 
and,  hence,  E  =  V4it  =  /•  The  last  equation  is  equivalent  to  the  quadratic  equation 
y2  +  Tr  ~  TE  =  0’  which  has  a  unique  solution  for  E  =  Em ;n  (the  radius  of  the  circular 
orbit),  two  positive  solutions  for  Em ;n  <  E  <  0,  and  only  one  positive  solution  for 
E>  0.  □ 


Next  we  will  solve  the  equations  of  motion.  The  function  1 1— »•  r(t)  is  determined 
by 


To  solve  this  equation  we  make  the  substitution  u  =  1  jr  and  compute 


du 

du 

dt 

r 

r 

dtp 

dt 

t{<P)  d(P  ~ 

r2tp 

t(v)  L 

which  implies 

d2u 

r  dt 

r 

rr2 

(N 

1 

M 

dtp2 

L  dtp 

Ltp 

L 2 

L2 

L2 

So  we  obtain  the  equation 

dfu  M 

~dy2+U  =  l?1 


the  general  solution  of  which  is 


u  =  kcos(<p  —  <po)  + 


M 

Z?’ 


where  k  >  0  and  (po  are  constants.  By  choosing  (po  =  0,  this  allows  us  to  write 
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P 

1  +ecos  tp  ’ 


P 


L2  _  kL2 
M  ’ 


(2.14) 


For  0  <  e  <  1  this  is  an  ellipse  of  eccentricity  e. 

For  e  =  1  it  is  a  parabola  and  for  e  >  1  a  component  of  a  hyperbola.  (Observe 
that  for  the  parabola  and  hyperbola  the  angle  (p  is  constrained  by  the  condition 
that  e  cos  <p  >  —  1 .)  Recall  that  for  an  ellipse  with  major  half-axis  a  and  minor  half¬ 
axis  b  the  distance  between  the  two  focal  points  is  2c,  where  c2  =  a2  —  b2,  and  the 
eccentricity  and  parameter  are  given  by 

c 

£=-, 

a 

see  e.g.  [10,  VIII. 43]  for  a  detailed  discussion  of  conic  sections. 

Summarizing  we  obtain: 

Theorem  2.24.  The  motions  of  a  particle  (with  nonzero  angular  momentum )  in 
Newton ’s  gravitational  potential  are  along  conic  sections  with  a  focus  at  the  origin. 
The  bounded  motions  are  ellipses. 

The  latter  statement  is  the  content  of  Kepler’s  first  law  (of  planetary  motion):  The 
planets  move  along  ellipses  with  the  sun  at  a  focal  point.  Kepler’s  second  law  asserts 
that  the  area  swept  out  during  a  time  interval  [t,t  +s]  by  the  line  segment  connecting 
the  sun  and  the  planet  depends  only  on  its  duration  s  and  not  on  the  initial  time  t. 
This  is  a  simple  consequence  of  the  conservation  of  angular  momentum  as  shown  in 
Appendix  A,  Exercise  13: 

Proposition  2.25.  Let  y :  /  — >•  [R3  \  {0}  be  the  motion  of  a  particle  in  a  radial  force 
field  F  according  to  Newton ’s  law  my"  =  F(y).  Then  the  angular  momentum  vector 
L  is  constant,  the  motion  is  planar  and  the  area  A(fo,fi)  swept  out  by  the  vector  y 
during  a  time  interval  [to, ft]  is  given  by 


Mtofii) 


L 

2 


(ft  —  to), 


where  L  is  the  length  of  the  angular  momentum  vector. 

As  a  corollary  we  obtain  Kepler’s  third  law : 

Corollary  2.26.  The  elliptic  orbits  in  Newton ’s  gravitational  potential  are  periodic 
with  period 


In 

7ml 


,3/2 


where  a  is  the  major  half-axis  of  the  ellipse. 


Proof.  For  a  given  initial  time  to  £  R  let  us  denote  by  T  =  T (to)  the  smallest  positive 
real  number  such  that  (r,<p)\to+r  =  (r,  <p),0,  where  t  H >  ( r(f),<p(f ))  is  an  elliptic 
motion  of  the  system.  Since  A  (to,  to  +  T)  =  b^T  is  the  area  of  the  ellipse,  T  is  clearly 
independent  of  to,  which  shows  that  the  motion  is  periodic.  The  area  of  the  ellipse  is 
A  =  nab  and,  hence. 
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2nab  (2.14)  2 nab 
~L~  ~  ^pM 


2: icv'l2 

Vm 


Chapter  3 

Hamiltonian  Mechanics 


Abstract  We  present  Hamilton’s  formulation  of  classical  mechanics.  In  this  formula¬ 
tion,  the  n  second-order  equations  of  motion  of  an  ^-dimensional  mechanical  system 
are  replaced  by  an  equivalent  set  of  2 n  first-order  equations,  known  as  Hamilton’s 
equations.  There  are  problems  where  it  is  favorable  to  work  with  the  2 n  first-order 
equations  instead  of  the  corresponding  n  second-order  equations.  After  introdu¬ 
cing  basic  concepts  from  symplectic  geometry,  we  consider  the  phase  space  of  a 
mechanical  system  as  a  symplectic  manifold.  We  then  discuss  the  relation  between 
Lagrangian  and  Hamiltonian  systems.  We  show  that,  with  appropriate  assumptions, 
the  Euler-Lagrange  equations  of  a  Lagrangian  mechanical  system  are  equivalent 
to  Hamilton’s  equations  for  a  Hamiltonian,  which  can  be  obtained  from  the  Lag¬ 
rangian  by  a  Legendre  transformation.  In  the  last  part,  we  consider  the  linearization 
of  mechanical  systems  as  a  way  of  obtaining  approximate  solutions  in  cases  where 
the  full  non-linear  equations  of  motion  are  too  complicated  to  solve  exactly.  This 
is  an  important  tool  for  analyzing  physically  realistic  theories  as  these  are  often 
inherently  non-linear. 


3.1  Symplectic  geometry  and  Hamiltonian  systems 

Definition  3.1.  A  symplectic  manifold  (M,  ft))  is  a  smooth  manifold  M  endowed 
with  a  symplectic  form  ft),  that  is  a  non-degenerate  closed  2-form  ft). 

Example  3.2  (Symplectic  vector  space).  Let  at  be  a  non-degenerate  skew-symmetric 
bilinear  form  on  a  finite  dimensional  real  vector  space  V .  Then  (V,  ft))  is  called  a 
(real)  symplectic  vector  space.  Every  symplectic  vector  space  is  of  even  dimension 
and  there  exists  a  linear  isomorphism  V  — >  IR2",  2 n  =  dim  V,  which  maps  ft)  to  the 
canonical  symplectic  form 


n 

©can  =  J2  ^X‘  A  dxn+' , 
i=  1 


(3.1) 
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where  (x1 , . . .  ,x2")  are  the  standard  coordinates  on  IR2". 

It  is  a  basic  result  in  symplectic  geometry,  known  as  Darboux’s  theorem,  see  e.g. 
[1,  Thm.  3.2.2],  that  for  every  point  in  a  symplectic  manifold  (M,  ft))  there  exists  a 
local  coordinate  system  (x1 , . . .  ,x2n)  defined  in  a  neighborhood  U  of  that  point,  such 
that 

n 

to  |  u  =  ^  dx1  A  dx"+l . 

i=  1 

So  ft) |  jj  looks  like  the  canonical  symplectic  form  on  R2”. 

Example  3.3  (Cotangent  bundle).  Let  n:N  =  T*M  —y  M  be  the  cotangent  bundle 
of  a  manifold  M.  We  define  a  l -form  A  on  N  by 

A^(v)  :=Z(dn(y)), 

for  all  i ^  €  N,  v  €  T^N.  The  1-form  is  called  the  Liouville  form.  Its  differential 
CO  =  dX  is  a  symplectic  form  on  N,  which  is  called  the  canonical  symplectic  form  of 
the  cotangent  bundle  N.  To  check  that  CO  is  indeed  non-degenerate,  let  us  compute 
the  Liouville  form  in  coordinates  (q1,...,qn,pi,...,p„)  on  K~l(U)  =  T*U  C  T*M 
associated  with  coordinates  (x  , . . .  ,x")  on  some  open  set  U  C  M: 

ql=x'on,  pj(%)  =  %  €  T*JJ . 


Since  under  the  projection  dn  :  TN  — >•  TM  the  vector  fields  d /dq'  and  d /dpi  are 
mapped  to  d /dx1  and  zero,  respectively,  at  every  point  E,  =  Y,^jdxJ \k(^)  £  T*U  we 
have 

This  shows  that 

A|  u=Y^Pidcl,^  (o\u  =YJdpi/\dq' , 

proving  that  CO  is  a  symplectic  form. 

Given  a  smooth  function  /  on  a  symplectic  manifold  (M,  ft)),  there  is  a  unique 
vector  field  Xf  such  that 

df  =  -Co(Xf,-). 

Definition  3.4.  The  vector  field  Xf  is  called  the  Hamiltonian  vector  field  associated 
with  f . 

According  to  Darboux’s  theorem,  we  can  locally  write  ft)  =  Y.dpi  A  dq'  for  some 
local  coordinate  system  (q1, ...  ,q",p\,  ...,pn)  on  M.  In  such  coordinates,  which  will 
be  called  Darboux  coordinates,  we  can  easily  compute 


d£d_ 

dpi  dq ' 


\ 

dq'  dpi)  1 


(3.2) 


see  Appendix  A,  Exercise  19. 
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Definition  3.5.  A  Hamiltonian  system  (M,  0).  II)  is  a  symplectic  manifold  ( M ,  co) 
endowed  with  a  function  H  called  the  Hamiltonian.  An  integral  curve 

y.l-yM  of  the  Hamiltonian  vector  field  Xu,  defined  on  an  open  interval  I  C  IR,  is 
called  a  motion  of  the  Hamiltonian  system.  The  corresponding  system  of  ordinary 
differential  equations 

y=XH(y)  (3.3) 

is  called  Hamilton’s  equation.  An  integral  of  motion  of  the  Hamiltonian  system  is  a 
function  f  €  C°°(M)  which  is  constant  along  every  motion.  The  symplectic  manifold 
(M,  co)  is  called  the  phase  space  of  the  Hamiltonian  system. 

Proposition  3.6.  Let  (M, Ot.ll)  be  a  Hamiltonian  system.  Then  H  is  an  integral  of 
motion. 

Proof.  Let  y :  /  — >■  M  be  a  motion  of  the  system.  Then 

jH(Y(t))  =  dHf  (t)  =  -co(XH(y(t)),  /(r))  =  -ta{XH,XH)  |y(f)  =  0. 

□ 

Proposition  3.7.  In  Darboux  coordinates,  Hamilton ’s  equation  for  a  curve  y :  I  — t 
M  takes  the  form 

r)  H  r) 

<f(0  =  ^(7(0)’  (3-4) 

for  all  i=  1  (here,  n  =  where  ql(t)  =  ql(y(t)),  pft)  =  Pi(y(t)). 

Proof.  This  follows  immediately  from  (3.2)  by  comparing  /  =  £  (V^f  +  A  j 
withX//(y).  □ 

We  refer  henceforth  to  eq.  (3.3)  as  Hamilton’s  equation  and  to  eq.  (3.4)  as 
Hamilton’s  equations. 


3.2  Relation  between  Lagrangian  and  Hamiltonian  systems 

In  this  section,  we  clarify  the  relation  between  Lagrangian  and  Hamiltonian  systems. 
We  will  show,  under  certain  assumptions,  that  the  Euler-Lagrange  equations  of  a 
Lagrangian  mechanical  system  (M,Jtf)  are  equivalent  to  Hamilton’s  equation  for 
a  certain  Hamiltonian  H  on  T*M  endowed  with  the  canonical  symplectic  form  co. 
If  this  is  true,  then  the  Lagrangian  mechanical  system  (M,  ££)  should  possess  an 
integral  of  motion,  corresponding  to  the  integral  of  motion  H  of  the  Hamiltonian 
system  ( T*M,co,H ). 
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3.2.1  Hamiltonian  formulation  for  the  Lagrangian  systems  of 
Example  2.3 

In  the  case  of  Lagrangians  of  the  form  2z^(v)  =  jg(v,v)  —  V(7r(v)),  v  £  TM,  con¬ 
sidered  in  Example  2.3,  we  showed  in  Proposition  2.10  that  the  energy  E(v)  = 
jg(v,v)  +  V(7r(v)),  v  £  TM,  is  indeed  an  integral  of  motion.  Moreover,  we  have  a 
natural  identification  of  TM  with  T*M  by  means  of  the  pseudo-Riemannian  metric: 

(j)  =  <fg  :  TM  -»  T*M,  vhj(v,-). 

Proposition  3.8.  Let  ( M,g )  be  a  pseudo-Riemannian  manifold  and  denote  by 
(/)  :  TM  —t  T*M  the  isomorphism  of  vector  bundles  induced  by  g.  Let  V  be  a  smooth 
function  on  M  and  consider  the  Lagrangian  ££  of  Example  2.3. 

(i)  Then  a  smooth  curve  y :  I  — >■  M  is  a  solution  of  the  Euler-Lagrange  equations  if 
and  only  if  the  cur\’e  (f>  o  f  :  I  — y  T*M  is  a  solution  of  Hamilton’s  equation  for  the 
Hamiltonian  II  =  E  o  (j)  1 . 

(ii)  Conversely,  if  a  smooth  curve  y :  I  — >  T*M  is  a  motion  of  the  Hamiltonian  system 
( T*M ,  ft),//)  then  the  curve  n  o  y :  /  — >■  M  is  a  motion  of  the  Lagrangian  system 

where  71  :  T*M  — >  M  denotes  the  projection.  The  maps  y  i— >-  tj)  o  f  and 
y  71  oy  are  inverse  to  each  other  when  restricted  to  solutions  of  the  Euler- 
Lagrange  equations  and  Hamilton’s  equations,  respectively. 

Proof.  We  prove  (i).  Part  (ii)  is  similar  and  part  of  Appendix  A,  Exercise  22.  Let 
(x  , . . .  ,xn)  be  coordinates  defined  on  some  open  set  U  CM.  They  induce  coordinates 
(q1, . . .  ,q",  q1,. . .  ,qn)  on  TU  C  TM  and  (ql , . . .  ,q",  pi, . . .  ,pn)  on  T*U  C  T*M.  In 
terms  of  these  coordinates  (j)  is  given  by 

dJ? 

q‘°(t>=q\  Pi°<l>  =  Y,(8ij07:)qJ  = 

where  K  :  'T M  — >  M  denotes  the  projection.  The  inverse  yt :  T*M  — >  TM  is  given  by 

q'°w  =  q'i  ql°v  =  Y,(8,] °n)Pj, 


where  (g'f  is  the  matrix  inverse  to  {gif)  and  K  :  T*M  — >  M  is  the  projection.  Therefore 
we  obtain 


H  =  Eoyr=  o^  +  Poji,  n  :  T*M  —y  M, 

and  by  Proposition  3.7,  Hamilton’s  equation  takes  the  form 

qi  =  JL{8iimPj=q\ 

1  ^  dekt  dV 

P‘  =  ~2  L-gjMptPt-JjM 
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dxl 


where  7 : 1  -y  M,  ql(t)  =  x'(7(/)),  <?'(/)  =  (/(/))  and  p,(f)  =  pi((j){Y (/)))•  So  we 

see  that  0  o  /  :  /  — »  T"M  is  a  motion  of  the  Hamiltonian  system  if  and  only  if 
f  :  I  — >  TM  satisfies 


q'=q',  pi=—(Y), 


where  Pi(t)  =  ^r{Y (/))■  Substituting  the  expression  for  pL  into  the  second  equation 
one  obtains  precisely  the  Euler-Lagrange  equations,  whereas  the  first  equation  holds 
for  every  curve  7.  □ 


Next  we  will  generalize  the  above  constructions  to  any  Lagrangian  satisfying 
an  appropriate  non-degeneracy  assumption  analogous  to  the  non-degeneracy  of  the 
pseudo-Riemannian  metric. 


3.2.2  The  Legendre  transform 


As  a  first  step  we  will  show  that  for  every  Lagrangian  mechanical  system  in  the  sense 
of  Definition  2.1  (that  is  for  which  the  Lagrangian  does  not  explicitly  depend  on 
time)  we  can  define  an  integral  of  motion  which  generalizes  the  energy  debited  for 
Example  2.3.  For  this  we  remark  that  for  the  Lagrangian  of  Example  2.3  the  energy 
can  be  written  in  the  form 


d ^ 


E  =  V 

^  dq<  1 


We  use  this  formula  to  dehne  the  energy  E  for  any  Lagrangian  ,1J .  To  show  that  the 
debnition  is  coordinate  independent  it  is  sufficient  to  remark  that  the  vector  held 


is  coordinate  independent.  It  is  in  fact  the  vector  held  generated  by  the  one-parameter 
group  of  dilatations 

( p,:TM^-TM ,  v  e‘v. 

Its  value  at  v  £  TM  is  simply 


£(v)=4  <Mv)=v, 

?=o 

where  on  the  right-hand  side  v  £  TXM,  x  =  n{\’),  is  interpreted  as  vertical  vector 
by  means  of  the  canonical  identibcation  TYelTM  =  TXM.  The  following  result  is  a 
generalization  of  Proposition  2.10. 
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Proposition  3.9  (Conservation  of  energy).  Let  ( M,2 z?)  be  a  Lagrangian  mechanical 
system.  Then  the  energy  E  =  E,  („£?)  —  is  an  integral  of  motion. 

Proof.  It  suffices  to  differentiate  1 K >  E(f(t))  along  a  motion  y :  /  — >■  M.  We  obtain 


which  vanishes  by  the  Euler-Lagrange  equations.  □ 

Next  we  generalize  the  isomorphism  0  =  (j)g  :  TM  — >  T*M  defined  by  a  pseudo- 
Riemannian  metric  g.  As  we  have  shown,  it  can  be  written  in  terms  of  the  Lagrangian 
as 

,  ,  ,  <3^ 

qo(f)=q,  Pio(j)  =  —. 

We  claim  that  these  formulas  define  a  smooth  map  tj)  =  (jtj?  :  T M  — >  T*M  for  any 
Lagrangian  f£ .  To  see  this  let  us  define  (j)^  in  a  coordinate  independent  way.  For 
v  €  TXM,  x  €  M .  we  define 

<Mv)  =  d  (T£\Txm)  |v  G  (TxM)*  =  T*M.  (3.5) 

What  we  obtain  is  a  smooth  map  that  maps  any  vector  v  €  TM  to  a  covector  )  £ 
T*M  at  the  point  x  =  7t(i’).  In  other  words,  0^-  is  a  one-form  along  the  projection 
map  n  :  TM  — >  M.  or,  equivalently,  a  section  of  the  vector  bundle  n*  T*M  over  T M. 
Notice  that  as  a  map  from  TM  to  T*M  it  is  fiber-preserving  but  (contrary  to  <j>g)  in 
general  not  linear  on  fibers.  Also,  in  general,  it  is  not  even  a  local  diffeomorphism. 
In  local  coordinates  we  have 


<MV) 


^  dq' 


Definition  3.10.  Let  (M,  L/’ )  be  a  Lagrangian  mechanical  system,  n  =  dim M.  Then 
Jzf  is  called  non-degenerate  if  is  of  maximal  rank,  that  is  if  d(j)jy  has  rank  2 n 
everywhere.  The  Lagrangian  is  called  nice  if  it  is  non-degenerate  and  (j)^>  is  a 
bijection. 

Proposition  3.11.  Let  (M,2z?)  be  a  Lagrangian  mechanical  system.  Then  the 
following  conditions  are  equivalent: 

1.  2z?  is  non-degenerate. 

2.  :  TM  — >•  T*M  is  a  local  diffeomorphism. 

3.  For  all  x  G  M,  txm  '■  TXM  — >  TfM  is  of  maximal  rank. 

4.  For  all  v  £  TM,  there  exists  a  coordinate  system  (xl 2 3 4)  defined  on  an  open 

neighborhood  U  oft r(v)  such  that  the  matrix  is  invertible,  where 

(ql , . . .  ,q" ,ql , . . .  1qn)  are  the  corresponding  coordinates  on  TU. 
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5.  For  all  v  GTM  and  every  coordinate  system  (xl )  de  fined  on  an  open  neighborhood 
U  ofn{v),  the  matrix  is  invertible. 

Proof.  See  Appendix  A,  Exercise  20.  □ 


Proposition  3.12.  Let  (M,2z?)  be  a  Lagrangian  mechanical  system.  Then  the 
following  conditions  are  equivalent: 

1.  2z?  is  nice. 

2.  ->  T*M  is  a  dijfeomorpliism. 

3.  2z?  is  non-degenerate  and  for  all  x  G  M,  <j>jg\rxM  '■  TXM  — >  TfM  is  a  bijection. 

4.  For  all  x  G  M,  <f>^\rxM  •  TXM  —y  T*M  is  a  dijfeomorpliism. 


Proof.  See  Appendix  A,  Exercise  21. 


□ 


We  can  now  generalize  Proposition  3.8  to  the  case  of  non-degenerate  Lagrangians. 
For  simplicity  we  assume  that  ££  is  nice  such  that  §<£  :  TM  —y  T*M  is  not  only  a 
local  but  a  global  diffeomorphism.  The  general  (local)  result  for  non-degenerate 
Lagrangians  is  left  as  an  exercise  (see  Appendix  A,  Exercise  23). 

Theorem  3.13.  Let  be  a  Lagrangian  mechanical  system  with  nice  Lag¬ 
rangian.  Then  0  =  :  TM  — >  T*M  is  a  diffeomorphism  and  the  following  hold: 

(i)  A  smooth  curve  y  :  I  —y  M  is  a  solution  of  the  Euler-Lagrange  equations  if  and 
only  if  the  curve  (j)  o  /  :  I  — ►  T*M  is  a  solution  of  Hamilton’s  equation  for  the 
Hamiltonian  H  =  F  o  (j)  ] ,  where  E  is  the  energy,  defined  in  Proposition  3.9. 

(ii)  Conversely,  if  a  curve  y  :  I  — >  T*M  is  a  motion  of  the  Hamiltonian  system 

(T*M ,(0,H),  then  the  curve  71  o  y  :  I  —y  M  is  a  motion  of  the  Lagrangian  sys¬ 
tem  where  TZ :  T*M  —y  M  denotes  the  projection. 

( iii)  The  maps  y^tfof  and  y  >  n  o  y  are  inverse  to  each  other  when  restricted  to 
solutions  of  the  Euler-Lagrange  equations  and  Hamilton’s  equations,  respectively. 

Proof.  Let  us  first  remark  that  the  function  H  on  T*M  gives  rise  to  a  smooth  map 
if/  =  i ffn  :  T*M  sTM.  For  a  GTfM,x  G  M,  we  define 


Wh(oc)  =  d  (H\t*m)  |a  G  (TfM)*  =  TXM.  (3.6) 

In  local  coordinates  ( ql ,pi )  on  T*U  C  T  M  and  ( q‘,q ')  on  TU  C  TM  associated 
with  local  coordinates  (x1)  on  U  CM,  it  is  given  by 

,  i  ~i  dH 

q  °Y  =  q  ,  q  oy/=—. 

a  Pi 

We  claim  that  (j)  and  l //  are  inverse  to  each  other.  In  particular,  y/  is  a  diffeomorphism. 
Since  we  already  know  that  (j)  is  bijective,  we  only  need  to  check  that  y/o(j)  =  Id/  v/- 
Obviously  ql  oy/otj)  =  q'.  Thus  it  is  sufficient  to  check  that  q'  oy/otj)  =  q‘.  Let  us 
denote  by  (a‘J  )  the  n  x  n- matrix  inverse  to  the  matrix  (a;/)  with  matrix  coefficients 

an  :=  Then  we  have 

dq'dqJ 
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=dEod((f>  l)  ^r— =  dE  o  (d(j>)  1 
dpi  opi  opi 

=  °r'  = 


dqi  J 


and 


dE 


d J? 


dqi  dqj'^JAk  &)  L^' 


dqk 


:  d2S£ 

dqidqk 


—  ^^q  a jk- 


Substituting  the  second  equation  into  the  first  equation,  we  obtain 

dH  ~i  A-l 
—  =q'o<l) 
dpi 


and,  hence. 


,  dEl 

q  o  if/otp  =  - —  o  0  =  q  . 
dpi 


This  proves  that  =  0  1  and,  in  particular,  that  p,  c  0  c  =  p,,  that  is 

d^f 


dq' 


’  ¥  =  Pi- 


(3.7) 


(3.8) 


To  relate  Hamilton’s  equation  to  the  Euler-Lagrange  equations,  we  compute  with  the 
help  of  (3.7),(3.8): 


dH  d  d 

dq1  =  dq‘  (E  °  ^  =  dq1 


:d.£ 


dH 


dq'  dp  j 


v-  d2H  d 

=ZmF,p,-w{  ^ 


x ,  d2H  dJ£  ^  ( d££  \  d(qi oy/)  dS£ 


t  V  ddj 


=PJ 


dq1 

32 H 
dq'Spj 


dq 1 


-ova.  (3.9) 


Now  we  see  from  (3.7)  and  (3.9)  that  Hamilton’s  equations  for  a  curve  y:  I  — »  T*M 
in  canonical  coordinates  (ql ,pt)  take  the  form 

4V(y(0)  =  f^(7(0)  =4‘(w(?(t)), 

Clt  (j  pi 

^(7(0)  =  -^(7(0)  =  ^(^(7(0)- 

dt  dqi  dq' 

Notice  that  the  first  equation  is  satisfied  if  and  only  if  the  curve  y/  o  y :  I  — )•  TM  is  the 
velocity  vector  field  /  of  the  curve  y  :=  n  c  f :  I  AT  Using  y/o  y  =  /,  the  second 
equation  can  be  written  in  the  form 
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In  view  of  (3.8)  this  is  equivalent  to  the  Euler-Lagrange  equations  for  the  curve  y.  So 
we  have  proven  (ii).  This  also  proves  (i)  by  considering  the  curve  f=(j)oy,  which 
projects  onto  y. 

In  order  to  prove  (iii),  let  us  first  remark  that  for  every  smooth  curve  y  in  M  we 
have  no  (f>  o "/  =  y,  simply  because  (j)  maps  TXM  to  T*M  for  all  x  £  M.  Now  let  y  be 
a  solution  of  Hamilton’s  equation.  Then,  as  shown  above,  \ft  o  y  =  /  is  the  velocity 
vector  of  y  =  n  o  y  and,  hence,  (j)  o  y  =  y.  This  proves  (iii).  □ 

Let  us  summarize  for  completeness  some  interesting  facts,  which  we  have  estab¬ 
lished  in  the  course  of  the  proof. 

Proposition  3.14.  Let  (M be  a  Lagrangian  mechanical  system  with  nice 
Lagrangian.  Then  the  inverse  of  the  dijfeomorphism  (j)  =  (j) %  :  T  M  — >■  T*M  is  given 
by  the  map  I f/  =  y/fj  :  T*M  — >•  TM  defined  by  H  =  E  o  in  equation  (3.6).  Under 
these  diffeomorphisms  the  following  functions  on  TM  and  T*M  are  mapped  to  each 
other:  _ 


TM 

4 

4 

d^/dcf 

E 

T*M 

4 

dH/dpi 

Pi 

H 

Next  we  will  explain  the  relation  of  the  previous  constructions  with  the  notion 
of  Legendre  transform  of  a  smooth  function  /  :  V  — >  R  on  a  finite-dimensional  real 
vector  space  V.  To  define  the  Legendre  transform  we  consider  the  smooth  map 

( pf.V^-V *,  x  i — ^  d fx.  (3.10) 

For  simplicity  we  will  assume  that  (j)f  is  a  diffeomorphism.  (More  generally,  we 
could  consider  the  case  when  <j)f  is  only  locally  a  diffeomorphism.)  Then  we  can 
define  a  new  function  /  :  V*  — ►  IR  by 

/  :=  (/)  ~f)°  Q/1 1  (3.H) 

where  |  is  the  position  vector  field  in  V,  that  is  £v  =  x  for  all  x  £  V .  Evaluating  this 
function  at  y  =  (j>f(x),  we  obtain 

f(y)  =  &(/)-/(*)  =  dfxx-f(x)  =  (y,x)  -f{x), 

where  (y,x)  =  yx  =  y(x)  is  the  duality  pairing.  This  shows  that  (3.11)  can  be  equival¬ 
ently  written  as 

f(y)  =  ((y,*)  -/W)U^-i(y)  •  (3-12) 

Definition  3.15.  The  function  f  :  V*  — ►  IR  is  called  the  Legendre  transform  of 
f:V^R. 

Proposition  3.16.  Let  f  be  a  smooth  function  on  a  finite -dimensional  real  vector 
space  V  such  that  (f>f  :  V  — >■  V*  is  a  dijfeomorphism  and  consider  its  Legendre 
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transform  f  £  C°°(V*).  Then  (j)j  :  V*  — >  V  is  a  cliff eomorphism  and  the  Legendre 
transform  of  f  is  f. 

Proof  See  Appendix  A,  Exercise  24.  □ 

Notice  that,  with  the  above  notations,  for  every  Lagrangian  2z?  £  C°°(TM )  the 
restriction  of  (/)$?  :  TM  — *  T*M,  defined  in  (3.5),  to  TXM,  x  £  M,  is  given  by 

0iHrvM  =  -^x  ■=  A£\txm  ■  TxM  ->  IR. 

Similarly,  for  every  function  H  £  C°°(T*M) 

Vh\t*m  =  0//v,  Hx  ■=  H \t*m  '■  TfM  — >  IR. 

In  view  of  this  relation,  we  will  now  unify  the  notation  and  define  (j>H  :  =  y///. 

Proposition  3.17.  Let  2z?  £  C°°(TM)  be  a  nice  Lagrangian  and  H  £  C°°(T*M) 
the  corresponding  Hamiltonian.  Then  H  is  the  fiber-wise  Legendre  transform  of  2z? 
and  vice  versa,  that  is  Hx  is  the  Legendre  transform  of  L£x  and  f£x  is  the  Legendre 
transform  ofHxfor  all  x  £  M. 

Proof  By  comparing  the  definition  of  the  energy  E  =  ^  (2z?)  —  ££  with  (3.11)  we 
see  that  H x  =  Ex  c  =  L£x  and  this  implies  that  Hx  =  ,i£x  by  Proposition  3.16.  □ 


3.3  Linearization  and  stability 

The  Euler-Lagrange  equations  and  Hamilton’s  equations  are  typically  non-linear.  We 
refer  to  Example  2.7  for  an  illustration  of  this  observation.  Non-linearity  is  also  a  key 
feature  in  many  interesting  physical  applications.  Examples  from  classical  mechanics 
are  the  aerodynamic  drag,  where  the  drag  force  is  proportional  to  the  square  of  the 
velocity,  and  the  Navier-Stokes  equations  describing  the  motion  of  viscous  fluids. 
Important  examples  from  modern  physics  are  Einstein’s  theory  of  general  relativity 
(see  Section  5.4.3)  as  well  as  the  Standard  Model  of  particle  physics. 

However,  non-linear  equations  are  notoriously  difficult  to  solve.  A  way  out  is  to 
consider  a  linear  approximation,  instead.  This  often  yields  valuable  insights  into  the 
true  behavior  of  the  underlying  non-linear  problem  and  serves  as  a  starting  point  for 
more  thorough  studies  (such  as  perturbation  theory). 

This  section  is  based  on  Ref.  [2,  Ch.  5].  In  this  section  we  denote  by  I  C  IR  an 
open  interval. 

Definition  3.18.  A  point  xo  £  IR"  is  called  an  equilibrium  position  of  the  system  of 
ordinary  differential  equations 

dx 

—  =/(x),  (3.13) 

dt 


if  the  constant  cun’e  x(t)  =  xq  is  a  solution. 
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Notice  that  the  equilibrium  positions  of  the  system  (3.13)  are  precisely  the  zeroes 
of/. 

For  the  rest  of  this  section,  we  consider  as  the  prime  example  a  classical  mechan¬ 
ical  system  on  M  =  IR"  with  canonical  local  coordinates  (t/1  ql,...,q")  on 

T M  and  Lagrangian 

%=  \Y*aiM)W -v(<l)  •  (3-14) 

The  functions  ai;(g)  are  chosen  such  that  .£/jn  =  bHaij{qW^  >  0  for  all  qf=0.  The 
motion  is  governed  by  the  the  Euler-Lagrange  equations 


d  d se  d% 

dt  dq'  dq1 


i  =  1 , . . . ,  n  . 


(3.15) 


Proposition  3.19.  The  point  q  =  qo,  q  =  qo  is  an  equilibrium  position  of  the 
Lagrangian  mechanical  system  (3.14)  if  and  only  if  qo  =  0  and  qo  is  a  critical  point 
ofV  (q),  that  is 

=  0  Vi  =  1, . . .,n  .  (3.16) 

?=?0 

Proof  From  Proposition  3.7  we  know  that  the  Euler-Lagrange  equations  (3.15)  can 
be  transformed  into  a  system  of  2 n  first-order  equations  of  the  form 


dV 

dq' 


4  =. 


a‘JPj  =  q' 


Pi  = 


1  _  dau 

2^WPkPr 


dV 

dq' 


where  (q  pi,...,pn)  are  local  coordinates  on  T*M  and  (alJ)  denotes  the 

inverse  matrix  of  (a,y).  At  an  equilibrium  position,  we  have  q  =  p  =  0  and  hence 
from  the  first  equation  q  =  0,  p  =  0.  With  p  =  0  the  first  term  in  the  second  equation 

vanishes  and  hence  we  conclude  that  q  =  qo  is  an  equilibrium  position  if  (3. 16)  holds 

and  only  in  that  case.  □ 

Returning  to  the  general  case,  ^  =  fix),  we  may  Taylor-expand  f(x)  close  to  an 
equilibrium  position  xq.  For  convenience,  we  may  assume  without  loss  of  generality 
xo  =  0  (by  a  translation  of  the  coordinate  system).  The  Taylor  expansion  close  to 
xq  =  0  then  becomes 

f(y)=Ay  +  0(\y\2),  (3.17) 

where  A  =  ^\x=o- 

Definition  3.20.  The  passage  from  the  system 


dx 

dt 


/(•*')  . 


x  :  /  — >■  R”  , 


dy 

dt 


=  Ay, 


y  :  /  — )•  TqIR”  —  R"  , 


to  the  linear  system 
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is  called  linearization  around  the  equilibrium  position  x'o  =  0. 

The  linearized  system  can  be  easily  solved  by 

y(t)  =  eAty( 0) , 

where  eAt  =  tn  +At  +  ^ A2t 2  + ...  is  the  matrix  exponential  series.  For  small  enough 
y,  the  higher-order  corrections  ^?(|y|2)  in  (3.17)  are  small  compared  to  y  itself.  Thus, 
for  a  long  time,  the  solutions  y(t)  of  the  linear  system  and  x(t)  of  the  full  system 
remain  close  to  each  other,  provided  that  the  initial  conditions  y(0)  =  x(0)  are  chosen 
sufficiently  close  to  the  equilibrium  position  xo.  More  precisely,  for  a  given  time 
T  >  0  and  e  >  0,  there  exists  a  8  >  0  such  that  for  any  Xq  €  IR"  with  |xj)  —  xo|  <  5  the 
solution  x(f )  of  (3.13)  with  initial  condition  x(0)  =  x^  exists  (at  least)  for  all  t  £  [0,  T] 
and  compares  to  the  solution  y  :  IR  — >  IR"  of  the  linearized  system  with  y(0)  =  Xq  by 

max  lx  —  y  I  <  £  . 

[0,71 


Consider  the  Lagrangian  mechanical  system  (3.14)  near  the  equilibrium  position 
q  =  qo  and  choose  coordinates  such  that  t/o  =  0. 

Proposition  3.21.  The  linearization  of  (3.14)  near  q  =  qo  is  given  by 


2 


0)qlqJ 


1  y  d2V 

2  "  dq'dql 


q'q]  . 

<1= 0 


(3.18) 


This  is  also  known  as  quadratic  approximation. 

Proof.  The  Hamiltonian  corresponding  to  (3.14)  is  given  by 

H=  ]-Y,alJ(l)plPj  +  V(<l)  ■ 


Hamilton’s  equations  can  be  written  as 

dH  rt  t  ■  i  dH 

Pi  =  -2~i  ,  <i  =  3—  =:  8  (p,q) , 
dq'  dpi 

The  linearization  of  this  system  is  obtained  by  Taylor  expanding  /  and  g  around 
q  =  p  =  0  keeping  only  terms  that  are  at  most  linear  in  p  and  q: 
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dg‘ 

’  dqi 

( daik 

V  <V' 


Pk 


dg' 

<dPj 


d2H 


Pj  = 


dpidqj 


d2H 


'  dpidp: 


Pi 


q1 + E  a'J  (°)Pj  =  L  a‘J  (°)Pj  ■ 


In  the  above  equations  a  vertical  line  is  used  as  a  shorthand  symbol  to  denote  the 
evaluation  of  the  preceding  expression  at  q  =  p  =  0.  Differentiating  and  combining 
the  two  equations  yields 


^  =  L«,;(°)P;  =  -L«U(0) 


d2V 


q 


9=0 


dq'dqk 

These  are  precisely  the  Euler-Lagrange  equations  obtained  from 
Example  3.22.  Consider  the  case  n  =  1 : 

if  =  l-a(q)q2-V{q)  . 

Let  q(t)  =  qo  be  an  equilibrium  position,  that  is 

=  0. 


dV 

dq 


9=90 


Assuming  without  loss  of  generality  qo  =  0,  the  linearized  Lagrangian  becomes 

if2=  l-aq2-^q2  , 


where  a  :=  a( 0)  and  ft  := 
eq.  (3.14)).  The  Euler-Lagrange  equation  corresponding  to  is  given  by 


9=90 


.  Note  that  a  >  0  by  assumption  (cf.  below 


q  =  —COfiq  . 


2  J8 

(Oo  :=  -  • 
a 


(3.19) 


The  motion  crucially  depends  on  the  sign  of  j3  or,  in  other  words,  on  whether  the 
potential  V  (q)  attains  a  local  minimum  or  maximum  at  the  equilibrium  position  qo. 
Indeed,  we  find  as  solution  of  (3.19)  with  integration  constants  c\,C2  G  IR, 


cicos(cuof)+C2sin(cuof)  , 
q(t)  =  ^  cicosh(|®o|r)  +  C2sinh(|®o|0 
c\t  +c2  , 


for  fi  >  0  (“small  oscillations”)  , 
for  P  0  (  runaway  behavior  )  , 
for  fi  =  0  (  “uniform  motion  ”)  . 


This  obserx’ation  leads  to  the  notion  of  stability. 

Now,  we  consider  the  case  of  higher-dimensional  motion  on  IR”,  n  >  1. 
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Definition  3.23.  Consider  a  Lagrangian  mechanical  system  of  the  form  (3.14)  with 
equilibrium  position  q  =  qo  and  set 


mi-= 


d2V 


dq'dql 


i,j= 


9=?0 


The  equilibrium  position  q  =  qo  is  called 

(i)  unstable,  if  £2  is  negative  definite, 

(ii)  stable,  if  £2  is  positive  definite, 

(iii)  degenerate,  if  £2  is  degenerate, 

(iv)  a  saddle  point,  if  £2  is  nondegenerate  but  indefinite. 

This  can  be  understood  as  follows.  Condition  (i)  implies  that  the  potential  V  {q) 
attains  a  local  maximum  at  the  critical  point  qo.  Hence,  any  small  displacement  away 
from  qo  causes  the  system  to  deviate  further  from  it,  since  the  net-force  is  directed 
away  from  qo-  An  example  of  such  a  situation  is  depicted  in  the  following  figure: 


In  case  (ii),  V(q)  attains  a  local  minimum  at  q  =  qo-  For  small  perturbations,  the 
system  stays  close  to  qo  for  all  times,  since  the  forces  are  “restoring  forces.”  This 
can  be  exemplified  as  follows: 


A  typical  example  of  case  (iii)  is  a  region  where  the  potential  V (q)  is  flat: 


In  that  case,  higher-order  derivatives  are  needed  to  decide  stability.  Finally,  condition 
(iv)  can  be  understood  as  a  situation  where  some  directions  are  stable  and  some  are 
unstable. 

For  the  rest  of  this  section,  to  streamline  the  presentation  we  only  consider 
stable  equilibrium  positions.  The  general  case  can  be  analyzed  similarly.  For  one¬ 
dimensional  motion  we  saw  in  Example  3.22  that  the  linearized  motion  around  a 
stable  equilibrium  position  is  oscillatory  in  nature, 

q(t)  =  c\  cos(a>ot)  +C2sin(G)()f)  . 

Here,  ©o  =  \/fi /a  is  the  frequency,  and  To  =  2k/ oto  the  period  of  the  oscillation. 
This  leads  us  to  the  next  concept,  namely  small  oscillations. 

Definition  3.24.  Motions  in  a  linearized  system  £6 \ ,  as  defined  in  (3.18),  are  called 
small  oscillations  near  an  equilibrium  position  q  =  qo- 
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Recall  that  the  linearized  Lagrangian  is  given  by 

■^2  =  \  £  OijqlqJ  -  ^  £  Qjq'q3  ■ 

.  Note  that  (ttjj)  and  Q  \=  (Qj)  are  sym¬ 
metric,  real  matrices  and  (a,y)  is  positive  definite.  It  is  useful  to  work  in  coordinates 
where  O',-,  =  5,;-,  where  8,j  is  the  Kronecker  delta. 

The  equations  of  motion  following  from  £6 \  are  a  set  of  a  priori  coupled  linear 
ordinary  differential  equations. 


where  a,j  :=  ai;( 0)  and  f2(/-  := 


d2V 

dq'dqJ 


ij'- 

Here,  the  Kronecker  delta  8n  (note  that  8IJ  =  5,;)  is  written  explicitly  in  order 
to  make  it  manifest  that  the  index  positions  on  both  sides  of  the  equation  match. 
By  a  suitable  choice  of  coordinates,  we  can  de-couple  the  ordinary  differential 
equations.  That  is,  we  diagonalize  the  real  symmetric  matrix  Q  using  an  orthogonal 
transformation  Mg  0(n ),  so  that 

M1  QM  =  diag(Ai, . . . , A,,)  , 

where  Ai , . . . ,  X„  S  IR  are  the  eigenvalues  of  Q  and  (-)T  denotes  matrix  transposition. 
Now,  we  define  new  coordinates  Q  on  IR"  as 


Q  =  MJq  . 


In  terms  of  the  new  coordinates,  the  equations  of  motion  become 
Q'  =  -XiQ'  (no  sum  over  i) . 

The  corresponding  Lagrangian  is  given  by 

-%  =  k(2')2-jl  Mof. 

z.  i  z.  i 

up  to  an  additive  constant.  The  de-coupled  system  of  equations  can  be  solved  straight¬ 
forwardly.  One  finds  n  independent  harmonic  oscillators  of  the  form 

Q‘(t )  =  c'j  cos  (^\Z%t'j  -fc^sin  (^y/Xjt'j  (no  sum  over  i) , 

with  real  integration  constants  (c'j  2);=i,...,n-  Recall  that  the  stability  criterion  (see 
Definition  3.23)  guarantees  A,  >  0  for  all  i.  Otherwise  the  linearized  motion  will 
in  general  not  be  periodic,  but  solutions  can  be  described  in  a  similar  way  using 
hyperbolic  and  linear  functions  for  the  cases  A,-  <0  and  A,  =  0,  respectively. 

What  we  have  just  learned  is  that  a  system  performing  small  oscillations  de¬ 
composes  into  a  direct  product  of  n  one-dimensional  systems  performing  small 
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oscillations.  In  particular,  the  system  can  perform  small  oscillations  of  the  form 

q(t)  =  [cicos(of)H-C2sin(G)f)]4  ,  (3.20) 

where  ft)  =  vX  and  <q  is  an  eigenvector  of  £2  corresponding  to  A,  that  is 

£2E,  =  XE,  . 

This  oscillation  can  be  regarded  as  a  product  of  Q'  =  c\  cos  (ft),/)  +  c‘2  sin(  ft),f )  and 
Q}  =  0,  j  f  i,  for  some  i  £  { 1 .  The  two-parameter  family  of  solutions  (3.20) 
is  called  a  characteristic  oscillation  or  eigen-oscillation  or  normal  mode  and  ft)  =  s/ X 
is  called  characteristic  frequency  or  eigen-frequency  (or  sometimes  also  resonance 
frequency).  Sometimes  also  an  element  of  that  family  is  called  a  characteristic 
oscillation.  The  vector  tq  is  called  the  eigenvector  corresponding  to  the  characteristic 
oscillation  and  a  system  of  characteristic  oscillation  is  called  independent  if  the 
corresponding  eigenvectors  are  linearly  independent. 

The  above  results  are  summarized  in  the  following  theorem. 

Theorem  3.25.  The  linearized  Lagrangian  mechanical  system  of  the  form  (3.18) 
near  a  stable  equilibrium  position  q  =  qo  performs  small  oscillations  given  by  a 
sum  of  characteristic  oscillations.  The  system  has  n  independent  characteristic 
oscillations  and  the  characteristic  frequencies  are  the  square  roots  of  the  eigenvalues 
of  the  Hessian  matrix  of  the  potential  V  (q)  at  qo  (assuming  that  CCjj  =  8jj,  without 
loss  of  generality). 

Note  that  a  sum  of  characteristic  oscillations  is  generally  not  periodic} 

The  linearized  Lagrangian  mechanical  system  ££%  can  now  be  solved  in  the 
following  way: 

(i)  Find  the  complex  characteristic  oscillations  of  the  form  q(t)  =  elCOt tq  by  substitut¬ 
ing  into  the  equations  of  motion  q  =  -Qq.  This  yields  a  characteristic  equation, 
£2q  =  ofq .  Solving  this  equation  produces  n  pairwise  orthogonal  eigenvectors 
Ik  with  corresponding  real  eigenvalues  A/;  =  ft)p. 

(ii)  The  general  real-valued  solution  is  a  linear  combination  of  (i).  That  is, 

9(f)  =  Re£c^+^§t, 

k= t 


with  and  8f.  real  parameters. 

Remark.  This  result  is  valid  irrespective  of  the  multiplicities  of  the  eigenvalues  X  . 

Example  3.26  (See  [2,  Ch.  5]  for  this  and  further  examples).  Consider  two  identical 
mathematical  pendula  of  unit  mass  connected  by  a  weightless  spring  as  depicted  in 
the  following  figure: 


1  For  example,  consider  a  case  with  two  eigenvalues  X  =  1,  X  =  2,  and  the  solution  q(t)  = 
sin(f)<fji  +sin(-\/2f)X. 
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For  small  oscillations,  we  have  E^m  =  \q\  +  \q2  and  E^ot  =V  =  \q\  +  \q\  +  \{q\  — 
q2)2,  where  the  last  term  in  V  is  due  to  the  spring.  We  choose  new  diagonalizing 
coordinates 


Qi 


qi+qi 

V2 


Qi 


q\-qi 

V2 


In  terms  of  the  new  coordinates,  the  linearized  Lagrangian  becomes 


■%2  —  ^kin  ^  £pot  —  ^  Q 1  +  ^  0-2  —  ^  Q\  ~  ^  ®2  Ql  i 


with  (Q\  =  1  and  txh 
(a)  Qz  =  0,  that  is  q\ 


s/1+2 k.  There  are  two  characteristic  oscillations,  namely 


This  is  the  case  where  both  pendula  move  in  phase  and  the  spring  has  no  effect, 
(b)  Qi  =  0,  that  is  q\  =  —q2- 


In  this  case,  the  pendula  move  in  opposite  phase  with  increased  frequency  ©2  >  1 
due  to  the  presence  of  the  spring. 


Chapter  4 

Hamilton- Jacobi  theory 


Abstract  Besides  the  Newtonian,  Lagrangian,  and  Hamiltonian  formulations  of 
classical  mechanics,  there  is  yet  a  fourth  approach,  known  as  Hamilton- Jacobi  theory, 
which  is  part  of  Hamiltonian  mechanics.  This  approach  is  the  subject  of  the  present 
chapter.  In  Hamilton- Jacobi  theory,  the  central  equation  capturing  the  dynamics  of  the 
mechanical  system  is  the  Hamilton- Jacobi  equation  which  is  a  first-order,  non-linear 
partial  differential  equation.  Remarkably  and  contrary  to  the  other  formulations  of 
classical  mechanics,  the  entire  multi-dimensional  dynamics  is  described  by  a  single 
equation.  Even  for  relatively  simple  mechanical  systems,  the  corresponding  Hamilton- 
Jacobi  equation  can  be  hard  or  even  impossible  to  solve  analytically.  However,  its 
virtue  lies  in  the  fact  that  it  offers  a  useful,  alternative  way  of  identifying  conserved 
quantities  even  in  cases  where  the  Hamilton- Jacobi  equation  itself  cannot  be  solved 
directly.  In  addition,  Hamilton- Jacobi  theory  has  played  an  important  historical  role 
in  the  development  of  quantum  mechanics,  since  the  Hamilton-Jacobi  equation  can 
be  viewed  as  a  precursor  to  the  Schrodinger  equation  [3,  7,  18]. 


For  simplicity  we  consider  Hamiltonian  systems  ( T*M,H )  of  cotangent  type,  that  is 
for  which  the  phase  space  is  the  cotangent  bundle  T*M  of  a  (connected)  manifold  with 
canonical  symplectic  form  (0  =  ft)can.  As  we  have  shown  in  Chapter  3,  this  includes 
the  systems  obtained  from  Lagrangian  mechanical  systems  with  a  nice  Lagrangian 
via  Legendre  transform.  By  Darboux’s  theorem  every  Hamiltonian  system  can  be 
locally  identified  with  an  open  subset  of  a  Hamiltonian  system  of  cotangent  type. 

We  will  now  describe  a  method  for  the  solution  of  Hamilton’s  equation  which  is 
based  on  the  following  simple  idea.  Consider  a  function  S  on  the  base  manifold  M.  It 
gives  rise  to  a  1-form  dS  :  M  — >■  T*M  and  for  every  curve  y :  I  — >  M  we  can  consider 
the  curve 

y  \  I  — y  T  M ,  1 1 — >  dSy  ^ . 
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We  would  like  to  know  when  y  is  an  integral  curve  of  the  Hamiltonian  vector  field 
Xfj.  Since  H  is  an  integral  of  motion,  a  necessary  condition  is  that  Ho  y  is  constant. 
Projecting  Hamilton’s  equation  f  =  A7/(y)  to  M  we  obtain 

y  =  dnXH(f).  (4.1) 

We  will  prove  in  the  next  theorem  that,  conversely,  this  equation  (for  n  =  dim M 
functions)  is  sufficient  to  solve  Hamilton’s  equation  (for  2 n  functions)  if  we  assume 
that  H  is  constant  on  the  section  dS  of  T*M.  Notice  that  in  canonical  coordinates 
( q\pi )  associated  with  local  coordinates  (x')  in  M  the  equation  (4.1)  corresponds  to 
the  system 

<?  H  . 

q  =  (yj,  l  =  1 

a  Pi 

where  q'(t)  =  q'(f{t))  =  x'(y(f)).  This  is  exactly  half  (that  is,  n  out  of  2 n)  of 
Hamilton’s  equations. 

Theorem  4.1.  Let  ( T*M,H )  be  a  Hamiltonian  system  of  cotangent  type  and  S  £ 
C°°(M).  Then  the  following  are  equivalent. 

1.  For  all  solutions  y  of  (4.1),  the  cur\’e  y  =  dSy  is  a  solution  of  Hamilton’s  equation. 

2.  The  Hamiltonian  vector  field  Xh  is  tangent  to  the  image  of  dS  :  M  — ¥  T*M. 

3.  The  function  H  o  dS  :  M  — >  IR  is  constant. 

To  prove  this  theorem  we  will  use  the  following  fundamental  lemma. 

Lemma  4.2.  Let  a  be  a  smooth  l-form  on  M.  We  consider  it  as  a  smooth  map 

(pa:M^r  T*M. 


Then  the  following  holds: 

(i)  The  pull  back  of  the  Liouville  form  A  under  this  map  is  given  by 

(p*af  =  a. 

(ii)  The  pull-back  of  the  canonical  symplectic  form  CO  is 

(Pa<o  =  da. 


Proof,  (i)  For  v  £TM,  we  compute 

(<p*A)(v)  =  A  {dcpav)  =  a(dndcpav)  =  a(v). 
Part  (ii)  follows  from  (i): 


(Pa(0  =  cp*adX  =  dcp*aX  =  da. 


□ 
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Definition  4.3.  An  immersion  cp  :  N  — >  M  of  an  n-dimensional  manifold  N  into 
a  2n-dimensional  symplectic  manifold  ( M ,  co)  is  called  Lagrangian  if  it  satisfies 
(p*co  =  0. 

Lemma  4.2  (ii)  immediately  implies: 

Corollary  4.4.  The  embedding  (pa  :  M  — >  T*M  defined  by  a  smooth  l -form  a  on  a 
manifold  M  is  Lagrangian  if  and  only  if  a  is  closed. 

Definition  4.5.  Let  (V,  co)  be  a  symplectic  vector  space.  A  subspace  U  CV  is  called 
Lagrangian  ifU  coincides  with 

U±  =  {v  £  V  |  C0(v,u)  =  0  for  all  u£U}. 

It  is  easy  to  prove  the  following  proposition  (see  Appendix  A,  Exercise  25). 

Proposition  4.6.  An  immersion  (p  :  N  —¥  M  of  an  n-dimensional  manifold  N  into  a 
2n-dimensional  symplectic  manifold  ( M ,  co)  is  Lagrangian  if  and  only  if  its  tangent 
spaces  dcpTxN  C  are  Lagrangian  for  all  x  £  N. 

Proof,  (of  Theorem  4.1)  Let  us  first  show  that  1.  and  2.  are  equivalent.  We  first 
assume  1.  and  show  that  Xh{cp(is(x))  is  tangent  to  the  Lagrangian  submanifold 
N  =  cptis(M)  C  T*M  for  all  x  £  M.  Let  y :  I  =  (—£,£)  — >  M,  £  >  0,  be  a  solution 
of  (4.1)  with  initial  condition  y(0)  =  x.  Then  the  curve  y  =  (p,/s  o  y  is  a  solution  to 
Hamilton’s  equation  and  lies  in  N.  Therefore, 

XH(  j)  =  i  £  TN 

and,  in  particular,  Xn{cpds{x))  =  Xh(y( 0))  =  y'(O)  €  TN.  So  we  have  proven  that 
1 .  implies  2.  To  show  the  converse,  we  observe  that  given  a  solution  y :  7  — >  M  of 
(4.1)  and  t  £  L  the  vector  f  (t)  £  Ty^N  is  the  unique  tangent  vector  in  Ty^N  which 
projects  to  /(f)  =  r/^A//(y(f)).  Since  the  vector  2(y(y(f))  £  T^,^T*M  projects  to  the 
same  vector  in  the  base,  we  see  that  /(f)  =  A//(y(f))  if  and  only  if  X//(y(f ))  £TN. 

Next  we  prove  the  equivalence  of  2.  and  3.  Since  N  C  T*M  is  Lagrangian,  the 
vector  field  Xh  is  tangent  to  N  if  and  only  if  at  all  points  u  £  N  we  have 

XH(u)  £  (7’„A)±  dHT„N  =  0  4=>  d{Ho  cpds)\n[u)  =  0. 


□ 


The  partial  differential  equation 

H  odS  =  const 

of  part  3  of  Theorem  4.1  is  called  the  Hamilton-Jacobi  equation.  We  can  summarize 
part  of  Theorem  4. 1  as  follows.  If  S  is  a  solution  to  the  Hamilton-Jacobi  equation  then 
every  integral  curve  of  Xh  passing  through  a  point  of  the  Lagrangian  submanifold 
N  =  cp(is{M)  C  T*M  is  fully  contained  in  N  and  these  integral  curves  are  obtained 
by  solving  a  system  of  n  ordinary  differential  equations  (rather  than  2 n). 
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Next  we  will  show  that  given  not  only  one  solution,  but  a  smooth  family  of 
solutions  of  the  Hamilton-Jacobi  equation  depending  on  n  parameters  we  can  (at 
least  locally)  completely  solve  Hamilton’s  equations  of  motion,  provided  the  family 
is  non-degenerate  in  a  certain  sense.  The  non-degeneracy  assumption  will  ensure  that 
the  family  cannot  be  locally  reduced  to  a  family  depending  on  fewer  parameters. 

Definition  4.7.  Let  ( T*M,H )  be  a  Hamiltonian  system  of  cotangent  type  and 
n  =  dim M.  A  family  of  solutions  of  the  Hamilton-Jacobi  equation  is  a  smooth 
function 

S'.MxU  — x  R,  (x,  u)  i— x  S(x,u), 

where  TJ  C  R"  is  an  open  subset,  such  that  for  all  u  £  U  the  function  x  K X  Su(x)  := 
S(x,  u)  on  M  is  a  solution  of  the  Hamilton-Jacobi  equation.  It  is  called  non-degenerate 
if  the  map 


$s  :  M  x  1/  -t  T*M,  (x,  u)  i-x  dSu  \x  £  T*M  CT*M 
is  of  maximal  rank. 

Let  ( Su)ueu  be  a  smooth  family  of  solutions  of  the  Hamilton-Jacobi  equation. 
Then,  for  all  u  £  U,  the  image  of  dS"  :  M  — x  T*M  is  a  Lagrangian  submanifold  Nu. 
The  smooth  map  <P$  maps  each  submanifold  M  x  {u}  CM  xU  diffeomorphically  to 
the  submanifold  Nu  C  I  '  M.  Moreover,  it  is  fiber  preserving  in  the  sense  that  it  maps 
the  fibers  {jc}  xU,  x  £  M,  of  the  trivial  projection  M  x  U  — X  M  into  the  fibers  T*M 
of  the  cotangent  bundle.  The  next  proposition  is  left  as  an  exercise  (see  Appendix  A, 
Exercise  26). 

Proposition  4.8.  Let  ( T*M,H )  be  a  Hamiltonian  system  of  cotangent  type,  n  = 
dimM  and  let  S  :  M  x  JJ  —X  R  be  a  smooth  n-parameter  family  of  solutions  of  the 
Hamilton-Jacobi  equation.  Then  the  following  conditions  are  equivalent. 

(i)  The  family  is  non-degenerate. 

(ii)  <Ps  '■  M  x  U  — X  T*M  is  a  local  dijfeomorphism. 

( Hi )  For  all  x  £  M, 


<Ps\{x}ru-{x}xU  =  U^T;M.  u  i— x  dSu\x, 

is  a  local  dijfeomorphism. 

(iv)  For  all  x  £  M, 

®s\{x}xu  :  {x}  x  U  “  U  ->  TX*M, 

is  of  maximal  rank. 

(v)  For  all  (x,u)  £  M  xU  the  n  x  n-matrix 

(  d2S(x,u)  \ 

\  dx'dul  ) 


is  invertible,  where  (x‘)  are  local  coordinates  in  a  neighborhood  of  x  £  M  and  {u') 
are  (for  instance)  standard  coordinates  in  U  C  R”. 
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Lemma  4.9.  Let  (T*M,H)  be  a  Hamiltonian  system  of  cotangent  type,  n  =  dimM, 
and  S  :  M  xU  R  a  smooth  non-degenerate  n-parameter  family  of  solutions  of  the 
Hamilton-Jacobi  equation.  Then  the  pull  back  of  the  canonical  symplectic  form  by 
the  local  dijfeomorphism  :  M  x  [/  ->  T*M  is  given  by 


d2S 

<Ptco  =  ~=r~- ^ — -dx?  A  duf 

^  dx'duJ 


with  respect  to  any  local  coordinate  system  (x1, . . .  ,x”)  on  M,  where  (u1 ,  . . .  ,«")  are 
standard  coordinates  on  U  C  IR”. 


Proof  Let  us  denote  by  (ql ,p/)  the  local  coordinates  on  T*M  associated  with  the 
local  coordinates  (x1)  on  M.  Then 


q'o(ps  =xf  p,  o  <ps 


dl 

dx' 


and,  hence. 


dS 


dfl  =  £  g~-dx‘  =4-  (PS at  =  d&sX  =  Y. 


d2S 

dx'did 


dx1  Aduf 


□ 

Theorem  4.10.  Let  (T'M .  II)  be  a  Hamiltonian  system  of  cotangent  type,  where  M 
is  connected,  n  =  dimM  and  S  :  M  x  U  — >  IR  a  smooth  non-degenerate  n-parameter 
family  of  solutions  of  the  Hamilton-Jacobi  equation .  Then  the  motions  of  the  Hamilto¬ 
nian  system  ( T*M,H )  contained  in  the  open  subset  <Pg(M  xU)  C  T'  M  are  of  the 
form  <L>s  °  7.  where  J  :  I  — X  M  x  U  is  a  motion  of  the  Hamiltonian  system 

(. MxU ,  ft)  :=  0s  ft),  H:=Ho<Ps). 


The  equations  of  motion  of  the  latter  system  can  be  solved  by  passing  to  local 
coordinates  of  the  form 

( ~i  i  ~  dS 

[q  :=U'Pi:=-W 

In  such  coordinates  we  have 


ft)  =  Ydq'  L\dpi, 


The  function  H  =  H(q1,...,qn)  depends  only  on  the  q'  and  not  on  pi,  and  the 
solutions  are  given  by 


q\t)=a\  pi{t) 


dH  j 


')dt. 


where  a1  are  arbitrary  real  constants. 
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Proof.  Since  <P^co  =  ft)  and  0(7/  =  H  it  is  clear  that  <J >s  maps  motions  of  (M  x 
U,  ft) .  H )  to  motions  of  ( T*M1H ).  In  this  way  we  obtain  all  motions  lying  in  the  image 

of  <l>s ■  Due  to  the  non-degeneracy  of  the  matrix  (  ^dx^d’tP  )  ’  t^le  functi°ns  {PtiPi)  f°rm 
a  system  of  local  coordinates,  when  restricted  to  an  appropriate  neighborhood  of  any 
given  point  of  M  x  U .  We  compute 


^dq1  Adpi  =  V  dul  Ad 


d2S 

dxldu> 


dx'  A  duJ . 


By  Lemma  4.9,  this  is  ft).  In  virtue  of  the  Hamilton-Jacobi  equation,  for  every  u  €  U, 
the  function  x  i — ^  H (jc,  u)  =  H  o  <Ps(x,u)  =  Ho  dSu\x  is  constant,  that  is,  it  depends 
only  on  it.  In  the  local  coordinates  (ql . p, )  this  means  that  H  =  // (q)  is  a  function 
solely  of  the  q'.  So  Hamilton’s  equations  reduce  to 


dH 

9=  0,  Pi  =  ~J~iW> 


which  are  trivially  solved  as  indicated. 


□ 


Next  we  discuss  an  example,  cf.  [2,  Ch.  9,  Sec.  47]  and  references  therein.  Con¬ 
sider  a  particle  of  unit  mass  moving  in  the  Euclidean  plane  under  the  gravitational 
potential  generated  by  two  equal  masses  placed  at  distance  2c  >  0,  say  at  the  points 
fi  =  (c,0)  and  /2  =  (— c,  0).  This  problem  can  be  conveniently  studied  by  consid¬ 
ering  the  distances  r\  and  ro  to  the  points  f\  and  ff  respectively.  The  gravitational 
potential  is 

V  k  k 
n  r2 

for  some  constant  k  >  0.  Away  from  the  Cartesian  coordinate  axes  one  can  use  the 
globally  defined  functions 


%  :=  n  +r2,  ri:=rl-r2, 

as  coordinates.  The  level  sets  of  E,  and  \rj  \  form  a  confocal  system  of  ellipses  and 
hyperbolas,  that  is  a  system  with  common  focal  points  From  the  geometry  of 

conic  sections  we  know  that  the  tangent  line  at  a  point  q  of  any  ellipse  or  hyperbola 
bisects  the  angle  between  the  two  lines  connecting  q  with  the  foci.  This  implies  that 
the  ellipses  and  hyperbolas  of  the  confocal  system  intersect  orthogonally.  Therefore 
the  Euclidean  metric  is  a  linear  combination  of  dtp  and  d J] 2  (with  functions  as 
coefficients). 

Proposition  4.11.  In  the  coordinates  (g .  1]  )  the  Euclidean  metric  takes  the  form 

<^-tr  je2l  ¥-ri2  Jn2 
8  4(£2-4c2)^  +4(4c2-T72)^  ' 

Proof.  It  is  sufficient  to  check  the  formula  on  the  ellipses  and  hyperbolas  of  the 
confocal  system.  The  ellipse  can  be  parametrized  as  y(f)  =  (acost.hsint),  where 
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r\  =  (acost  —  c)2  +  b2sm2t  =  a2cos2t  —  2accost  +  c2  +  b2sm2t 
=  c2  cos2 1  —  2accost +  a2  =  (a  —  ccost)2 


and,  thus, 


r\=a  —  ccosf,  r2=a  +  ccost,  E,=2a ,  77^  —  2ccosf. 

This  implies  4c2  —  77 2  =  4c2  sin2 1  =  77 2  and,  hence, 

£(/,/)  =  4(4C2  ^2,  =  ^2-j72)  =a2~c2cos2t  =  a2 sin2 1  +  b2 cos2 1. 

The  result  coincides  with  gCan(Y,Y ),  the  square  of  the  Euclidian  length  of  /  = 
(— asmt,bcost).  The  analogous  calculation  for  the  hyperbola  is  left  to  the  reader 
(see  Appendix  A,  Exercise  31).  This  proves  that  gcan  =  g  on  the  domain  of  definition 
of  the  coordinates  (£,rj).  □ 

Corollary  4.12.  In  the  coordinates  (c .  77 ),  the  Hamiltonian  of  a  particle  of  unit 
mass  moving  in  the  Euclidean  plane  under  the  gravitational  potential  generated  by 
two  equal  masses  placed  at  f\  =  (c,0)  and  f2  =  (— c,0)  has  the  form 


rr  o  2  2  4c2  0  24c2-1?2  4fc| 

P^2_tj2  +ZPri  ^2_jj2  £2  _  ^2  ' 


(4.2) 


Proof  For  any  local  coordinate  system  (x1  ,x2)  we  have 


H(q,p)  =  lYjS'J((l)PiPj  +  v(cl)^ 


where  (q1  ,pi)  are  the  corresponding  local  coordinates  on  the  cotangent  bundle  of  R2. 
Inverting  the  diagonal  matrix  representing  the  metric  in  the  coordinates  {E, ,  Tj )  and 
dividing  by  2  yields  the  kinetic  term  in  (4.2).  The  potential  term  is 

V  -----  -  Kr2  +  r\)  _  4 k% 

r\  r2  r\r2  £2-V2' 


□ 

In  the  coordinates  (E, ,  77),  the  Hamilton-Jacobi  equation  H  (q,  ||  j  =  C  =  const 
takes  the  form 

2  (If)  2  ^  -4c2)  +2  (f^)  2  (4c2  ~  n2)  -4*4  =  C(£2  -  T72). 

The  variables  (£,pg)  and  {t),Pt\)  can  be  separated  as  follows.  We  look  for  solutions 
of  the  system 
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where  c  \ .  c?  are  constants.  Solving  each  equation  for  the  partial  derivatives  dS/di \ 
and  dS/dri,  respectively,  and  integrating  yields  the  following  two-parameter  family 
of  solutions  of  the  Hamilton-Jacobi  equation 


Remark.  Hamilton-Jacobi  theory  can  be  extended  to  the  case  of  time-dependent 
Hamiltonians  H{q1p,t).  The  corresponding  generalization  of  the  Hamilton-Jacobi 
equation  has  the  form 


(4.3) 


where  S  =  S(q,t )  is  now  allowed  to  depend  on  time  t.  The  choice  of  the  letter  S  is 
related  to  the  action  functional  of  a  Lagrangian  mechanical  system  as  defined  in 
Definition  2.4.  Let  us  very  briefly  explain  this  relation.  Let  be  a  Lagrangian 

mechanical  system  with  possibly  time-dependent  Lagrangian.  Fix  a  point  xo  £  M  and 
a  time  to-  Suppose  that  for  (x,t)  £  M  x  IR  in  a  suitable  domain  there  exists  a  unique 
motion  s  i-4  y(,v)  (inside  a  suitable  domain  in  M)  such  that  y(fo)  =  xo  and  y(f)  =  x. 
Then  we  can  define 


Under  appropriate  assumptions,  it  can  be  shown  that  S(X(ht())  is  a  smooth  family  of 
solutions  of  the  time-dependent  Hamilton-Jacobi  equation  (4.3),  see  [2,  Ch.  9,  Sec. 
46], 


Chapter  5 

Classical  field  theory 


Abstract  A  second  cornerstone  of  classical  physics  besides  point-particle  mech¬ 
anics  is  field  theory.  Classical  field  theory  is  essentially  an  infinite  collection  of 
mechanical  systems  (one  at  each  point  in  space)  and  hence  can  be  viewed  as  an 
infinite-dimensional  generalization  of  classical  mechanics.  More  precisely,  solutions 
of  classical  mechanical  systems  are  smooth  curves  1 K »•  7(f)  from  IR  to  M.  In  classical 
field  theory,  curves  from  IR  are  replaced  by  maps  from  a  higher-dimensional  source 
manifold.  In  this  more  general  framework  we  also  allow  for  Lagrangians  with  ex¬ 
plicit  time  dependence.  Another  key  feature  of  classical  field  theory  is  its  manifest 
incorporation  of  the  laws  of  Einstein’s  theory  of  relativity.  This  chapter  begins  with 
definitions  and  properties  of  the  central  objects,  namely  fields,  Lagrangians,  action 
functionals,  and  the  field-theoretic  version  of  the  Euler-Lagrange  equations.  Modem 
covariant  field  theory  is  customarily  formulated  in  the  language  of  jet  bundles,  which 
is  also  utilized  and  thus  introduced  here.  We  study  symmetries  and  conservation 
laws  of  classical  field  theories  in  the  second  part  of  this  chapter,  which  culminates  in 
the  field-theoretic  version  of  Noether’s  theorem.  The  penultimate  section  is  devoted 
to  a  thorough  presentation,  from  a  mathematical  perspective,  of  some  prominent 
examples  of  classical  field  theories,  such  as  sigma  models,  Yang-Mills  theory,  and 
Einstein’s  theory  of  gravity.  A  key  ingredient  of  matter-coupled  Einstein  gravity  is 
the  energy-momentum  tensor,  which  is  studied  in  detail  in  the  final  section.  This 
chapter  is  largely  based  on  Ref.  [16]. 


5.1  The  Lagrangian,  the  action  and  the  Euler-Lagrange 
equations 

The  fields  of  a  classical  field  theory  are  basically  smooth  maps  /  :  6  — ►  &  from  a 
source  manifold  ©  (possibly  with  boundary)  to  a  target  manifold  & .  The  theory  is 
defined  by  an  action  functional  of  the  form 


45 


46 


5  Classical  field  theory 


S\f]  =  [  J?(jk(f))dvol,  (5.1) 

Je 

where  dvol  is  a  fixed  volume  element  on  ©  and  the  Lagrangian  is  a  smooth 
function  on  a  certain  bundle  JetA(©,  £?)  over  ©.  The  elements  of  the  fiber  Jet^(©, 
over  x  £  ©  are  defined  as  equivalence  classes  of  smooth  maps  ©  — >  S? ,  where  two 
maps  are  equivalent  if  their  Taylor  expansions  with  respect  to  some  local  coordinate 
system  coincide  up  to  order  k  at  x.  This  condition  does  not  depend  on  the  particular 
choice  of  local  coordinates.  The  equivalence  class  of  /  with  respect  to  the  above 
relation  is  denoted  by  jkx  (/)  £  Jet^f©.  rf)  and  is  called  the  k-th  order  jet  of  /  at  x. 
The  map 

/(/):©->  Jeti(6,^),  x^jkx(f), 

is  a  smooth  section  of  the  jet  bundle  Jet*(©,  fif). 

In  many  physically  interesting  cases  the  source  manifold  ©  is  interpreted  as  a 
space-time  of  a  certain  dimension.  The  Lagrangian  mechanical  systems  discussed  so 
far  can  be  considered  as  classical  field  theories  of  order  k=  1  with  one-dimensional 
space-time.  The  source  ©  =  I  is  an  interval  and  the  target  manifold  £?  is  the  config¬ 
uration  space  of  the  mechanical  system. 

For  simplicity,  we  will  assume  for  the  moment  that  ©  is  either  a  bounded  domain 
Q.  C  IR"  with  smooth  boundary  or  Q  =  R",  and  that  &  =  km.  (Recall  that  a  domain 
is  a  connected  open  set.)  The  basic  ideas  can  be  explained  with  almost  no  loss  of 
generality  in  this  setting,  cf.  [16].  For  the  convergence  of  the  action  integral  one  needs 
to  require  certain  boundary  conditions  on  /,  depending  on  the  particular  problem 
under  consideration. 

Using  standard  coordinates  (x1,.  ••>*”)  on  12  C  R"  and  (y,...,ym)  on  Rm  we 
can  trivialize  the  bundle  Jeti(12,Rm)  and  so  identify  it  with  12  x  V,  where  V  = 
.let^fR”,  Rm)  is  the  vector  space  consisting  of  vector- valued  Taylor  polynomials 
of  order  k.  It  has  natural  global  coordinates  denoted  by  uL\,  where  a  =  1 . . . . .  m 
and  I  =  (t'l , . . . ,  i()  £  [  1 '  runs  through  all  unordered  multi-indices  of  length 
i<k.  So  multi-indices  which  differ  only  by  a  permutation  are  not  distinguished. 
The  dimension  of  V  is  For  any  smooth  map  /  :  Q  E'"  and  multi-index 

I  =  (i| , . . . .  if  )  of  length  t<k  we  have 

uaIu\f))=difa=dil---dkr, 

where  <9;  =  dxi  and  fa  =  ya  o  /,  a  =  1 , . . . ,  m.  So  the  k-th  jet  of  /  at  some  point  x  £  i2 
is  simply  given  x  together  with  the  partial  derivatives  up  to  order  k  of  the  components 
fa  of  /  at  x. 

Example  5.1.  Consider  the  case  n  =  3,  m  =  1  and  k  =  2.  With  obvious  notational 
simplifications,  the  natural  coordinates  ofV  =  Jet^R”,  R)  are 

U,Ul,U2,U3,Uu,Ul2,Ul3,U22,U23,U33 


and  dimV  =  10  =  Q- 
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In  order  to  derive  the  equations  of  motions  of  a  classical  field  theory  it  is  useful  to 
introduce  the  total  derivative  Dj  =  Dxi  which  is  defined  by 


Dj  —  dj  + 


E 


4I,i 


d 

duaj 


such  that,  by  the  chain  rule, 

d  5^ 

dtm/f))  =  («(//  )+Yj[dun—{jkf)  =  (A-^x/y). 

I  O'M/ 

For  a  multi-index  I  =  if  )  of  length  |/|  :=  £  we  define 


D i  =  Dit  ■  ■  ■  Die 


and 

(-£))/  ={-Dil)-(-Dll)  =  (-l)tDI. 

Theorem  5.2.  Let  f£  be  a  smooth  function  on  Jet^ifi .  IR'")  and  consider  the 
corresponding  action  functional  (with  respect  to  the  canonical  volume  form).  A 
smooth  map  f  :  Q.  -4  IRm  is  a  critical  point  of  the  action  under  smooth  variations 
with  compact  support  in  Q  if  and  only  if  it  has  finite  action  and  is  a  solution  of  the 
following  system  of  partial  differential  equations  of  order  <  k: 

d//> 

a„  :=  ^  (-D)/  — (//)  =°,  a  =  l,...,m.  (5.2) 

|/|<*  °ui 


Definition  5.3.  The  equations  (5.2)  are  called  the  Euler-Lagrange  equations  or 
equations  of  motion  of  the  classical  field  theory  defined  by  the  Lagrangian  ££ .  A 
solution  of  the  classical  field  theory  is,  by  definition,  a  solution  of  its  equations  of 
motion  ( irrespective  of  whether  it  has  finite  action  or  not). 


In  order  to  compare  to  the  Euler-Lagrange  equations  in  mechanics  we  observe 


that 


L(-d) 

\I\<k 


d 

1 du " 


d 

dua 


+  E  (-D) 

i<i/i<* 


d 

1  du"  ' 


So  for  a  first  order  Lagrangian  we  obtain 


_d_  y  _d_ 
dua  E  '<5m" 


In  the  case  n  =  1  these  are  precisely  the  Euler-Lagrange  equations  of  classical 
mechanics  if  we  denote  m"  =  qa,  u"  =  qa,  x1  =  t,  and  D\  = 

Proof,  (of  Theorem  5.2)  A  smooth  map  /  with  finite  action  is  a  critical  point  of  the 
action  if  and  only  if  for  all  variations  h  with  compact  support  we  have 
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S\f  +  eh]  =  0. 

6=0 


We  compute 


d 

de 


e=0 


*(/a  +  e*))=- 


£=0 

ajs? 


^(/(/) +e/(/ o) = E  tt(//k(/w) 


Applying  the  divergence  theorem  to  if  <9/2  7^  0,  or  to  the  boundary  of  an  open 
ball  containing  the  support  of  /;  if  =  IR",  we  obtain 


d 

de 


S\f + eh] 
e=o 


E/fl  sf  (sf  <*») 

E^  Ukf)hadvol  =  Y.jn^hadv°l- 


This  vanishes  for  all  /;  if  and  only  if  afl  =  0  for  all  a  =  1, . . .  ,n.  □ 

The  one-form  a  -  }_  aadya  along  /  is  called  the  Euler- Lagrange  one-form.  It 
generalizes  the  one-form  which  we  encountered  in  Lagrangian  mechanics.  We  will 
also  consider  the  Euler-Lagrange  operators 

£fl:=E(-D)/A>  a=l,...,n,  (5.3) 

|/|<*  aui 

which  are  differential  operators  acting  on  smooth  functions  on  the  manifold 
Jet^(f2, IRm),  such  as  Jzf.  The  operator  Ea  is  related  to  the  function  a„  €  C°°(Q) 
by 

Ea(J?)(jkf)  =  CCa. 

It  is  clear  that  the  Lagrangian  of  a  classical  field  theory  determines  the  equations 
of  motion  and,  hence,  the  solutions  of  the  theory.  Adding  a  constant  to  the  Lagrangian 
does  not  change  the  equations  of  motion.  More  generally,  we  will  show  that  adding  a 
total  divergence  does  not  alter  the  equations  of  motion. 

Definition  5.4.  A  total  divergence  is  a  function  f  on  JetA(f2,  IRm)  of  the  form 


f  =  Div f  :=  E  DiP', 
i=  1 


where  P  =  (P1, . . . ,  Pn)  is  a  smooth  vector-valued  function  on  Jet*  ({2,  Rm)  for  some 
£>k. 

Proposition  5.5.  Let  ££  =  Div  P  be  a  total  divergence.  Then  the  Euler-Lagrange 
operators  Ea  vanish  on  : 


£fl(jS?)=0. 
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Proof.  Let  /,  h  :  PI  — >  Rm  be  smooth  maps  and  assume  that  h  has  compact  support  in 
Q.  Let  Q'  be  a  domain  with  smooth  boundary,  such  that  Q1  contains  the  support  of 
h  and  is  relatively  compact  in  Cl.  Then,  as  in  the  proof  of  Theorem  5.2,  we  have  that 


Y, JnEa (-^  ukf))ti‘dv°l 


£  [  Ea(J?(jkf))h“dvol 
J  n' 

L/d,  )  (//)"'■»' 

r  7) 


d 

de 


e=0 


[  PP  (jk  (f  +  eh))dvol . 
Jn1 


By  the  divergence  theorem, 


f  PP(jk(f  +  eh))dvol 
Jn1 


Di  vP(j((f  +  eh))dvol 


[  ( P(jCf),v)dvoldnf , 

Jdn1 


where  v  denotes  the  outer  unit  normal  and  dvoljQi  the  induced  volume  form  of 
dO.’  C  R".  Since  the  result  does  not  depend  on  e,  we  conclude  that  Ea(Pf  (jk /)))  =  0 
for  all  smooth  maps  / :  Q.  — >  Rm.  This  proves  that  Ea(Pf)  =0.  □ 

Theorem  5.6.  Let  2z?i,2z?2  £  C°°(Jeti:(R”,Rm))  be  two  Lagrangians  of  order  <  k 
defined  on  R".  Then  ££\  and  have  the  same  Euler-Lagrange  equations  if  and  only 
if  P£\  —  Jz?2  is  a  total  divergence. 

Proof.  By  the  previous  proposition,  we  already  know  that  P£\  and  have  the  same 
Euler-Lagrange  equations  if  P£\  —  P/p  is  a  total  divergence.  Therefore  it  suffices  to 
show  that  a  Lagrangian  2z?  £  C°°(Jeti:(R",  R"1))  such  that  Ea(PP)  =  0  for  all  a  (that  is 
a  Lagrangian  with  trivial  equations  of  motion)  is  necessarily  a  total  divergence.  For 
every  /  £  C°°(R”,  Rm)  we  have 


&{?f)  =  [  ^P?{ejkf)de-P?{s0),  (5.4) 

where  so  stands  for  the  zero  section  of  the  vector  bundle  Jeti(Rn,  Rm)  — »  R”,  that  is 
the  £-th  order  jet  of  the  constant  map  R"  — ►  R"1,  jr  i — >•  0.  Clearly  the  function  Pf(s o)  £ 
C°°(R")  does  not  depend  on  /  and  can  be  written  in  the  form  Pf(so(x))  =  d\ F\  (x) 
for  some  function  F\  £  C°°(R").  This  shows  that  it  is  a  total  divergence  Y^DjFj,  since 
we  can  simply  put  F{  =  0  for  2  <  i  <  n.  It  remains  to  show  that  the  integral  on  the 
right-hand  side  of  (5.4)  is  also  a  total  divergence.  By  the  chain  rule  we  have 

CM 
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Performing  partial  integrations  for  each  term  I=\I\<k  brings  this  to  the  form 

r) 

Lfa(~D)^(£Jkf)  +  (DivPe)U2kf),  (5-5) 

Ollj 

where  Pe  =  (Pj  .. . .  .P")  is  a  vector- valued  function  on  Jet2*  1  ( R" ,  Rm )  depending 
smoothly  on  all  the  variables,  including  the  parameter  e.  (Notice  that  DivPf  is 
thus  a  function  on  Jet^flR”,  Rm)  depending  smoothly  on  all  the  variables,  including 
the  parameter  e.)  The  first  term  in  (5.5)  vanishes  by  the  assumption  Ea(Jf)  = 
=°.Thus, 

j-£^{ejkf)de  =  ^Div  fQ  Ped^j  ( j2kf ) 

is  a  total  divergence.  □ 


5.2  Automorphisms  and  conservation  laws 

We  begin  by  observing  that  for  every  pair  of  smooth  manifolds  M,  N  the  group 
Diff(M)  x  Diff(lV)  acts  naturally  on  CX‘(M ,N).  In  fact,  given  a  group  element  g  = 
(<p,  y/)  and  a  smooth  map  f  :  M  N  we  have 

g-f  =  V°f°(p~l. 

This  action  induces  an  action  on  the  jet  bundle  ietk(M,N): 

g-Ukf)'-=  jkv(x)(g-f),  x£M. 

Definition  5.7.  Let  M,N  be  smooth  manifolds,  dvol  a  volume  form  on  M,  n  = 
dimM  and  Jz?  £  C°°(Jetk(M,N))  a  Lagrangian.  An  element g  =  ((p,  l//j  £  Diff+(M)  x 
Diff(iV)  is  called  an  automorphism  of  the  Lagrangian  n-form  Jfdvol  if  for  all 
f  £  x  £  M: 

■&{ g-jkxf){<P*dvol)x  =  f£(jkxf)dvolx .  (5.6) 

Here  Diff+(M)  G  Diff(M)  denotes  the  subgroup  of  orientation  preserving  diffeo- 
morphisms  ofM. 

Pulling  back  the  n-forms  in  equation  (5.6)  by  (p  1  we  obtain  the  equivalent 
equation 

■  jkv-Hx)f)dvolx  =  J?{jk^(x)m<p-lTdvol)x.  (5.7) 

Proposition  5.8.  If  g  £  Diff+(M)  x  Diff(/V)  is  an  automorphism  of  Jzf dvol,  then 

%■/]=/  ■&{jk{g-f))dvol  =  S\f\=(  -S?( jkf)dvol 

JM  JM 
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for  all  f  £  C°°(M,N). 
Proof.  This  follows  from 


\rXT  (& (ff)dvol ) 


(5.6) 


=  J?U;-l{x)Mcp-lYdvol)x  W  J?(g.j*9_1[x)f)dvolx 


=  ■Z'Uxis  ■  f))dvolx. 


□ 

The  calculation  in  this  proof  shows  that  g  =  ( (p ,  y/)  £  Diff+(M)  x  Diff(A')  is  an 
automorphism  of  JZ’dvol  if  and  only  if  for  all  /  £  C°°(M,N ): 

(<p-1)*  (ft f(jkf)dvol )  =  fif(jk(g-f))dvol, 

or,  equivalently, 

< P *  f))dvol}  =  Jf(jkf)dvol. 

Definition  5.9.  A  conservation  law  for  a  Lagrangian  f£  £  Cr°°(Jet*([R”,  Rm))  is 
a  total  divergence  DivP  which  vanishes  on  all  solutions  of  the  Euler-Lagrange 
equations  of  J£. 

Theorem  5.10  (Noether).  Consider  a  classical  field  theory  defined  by  a  Lagrangian 
Jz?  £  C°°(Jet*'  ([R”,  Rm))  and  denote  by  dvol  the  standard  volume  form  o/R".  With 
every  local  one-parameter  group  of  local  automorphisms  of  dvol  one  can  associate 
a  conservation  law  of  the  form  DivP  =  Y^Qa  Ea{T£),  where  Qa  =  Ya  —  Y,n‘-Xl  is 
determined  by  the  vector  field  Yf'=  \  X‘  ^  +  H'a=  i  generating  the  local  one- 

parameter  group. 

For  the  proof  of  Noether’s  theorem  we  will  use  a  series  of  lemmas.  Let  X,Y  be 
smooth  vector  fields  on  R"  and  Rm,  respectively.  Then  the  flow  of  Z  =  X  +  Y  on 
R"  x  Rm  induces  a  flow  on  Jet^R",  Rm)  and  we  denote  the  corresponding  vector  field 
on  Jet*(R",  Rm)  by  pr®Z. 

Definition  5.11.  The  vector  field  pr'^'Z  is  called  the  k-th  prolongation  ofZ. 

Lemma  5.12.  Consider  a  Lagrangian  Jz?  £  C°°(JetA(R”, Rm))  and  the  standard 
volume  form  dvol  on  R".  Let  X,Y  be  smooth  vector  fields  on  R"  and  Rm,  respectively. 
Then  Z  =  X  +  Y  is  an  infinitesimal  automorphism  of  dvol  if  and  only  if 

(prwZ)(Jz?)  +  JzfdivX  =  0.  (5.8) 

Proof.  Recall  that  the  divergence  divX  of  a  smooth  vector  field  X  on  R"  is  character¬ 
ized  by  the  equation 

Lxdvol  =  di  vXdvol, 

where  Lx  denotes  the  Lie  derivative.  The  differential  equation  characterizing  infin¬ 
itesimal  automorphisms  is  obtained  by  differentiating  the  equation  (5.6)  with  respect 
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to  t  after  evaluation  on  a  one-parameter  group  gt  =  (( p, ,  \j/t)  G  Diff+(R")  x  Diff(R'”). 
The  derivative  of  the  left-hand  side  is 

■^igt  ■  jkxf)(<P?dvol)x  =  {prWz)(£’)\jkfdvolx  +  3?(jxf)(divX)xdvolx, 

t= o 

whereas  the  right-hand  side  does  not  depend  on  t.  □ 

Lemma  5.13.  Let  X,Y  be  smooth  vector  fields  on  R"  and  Rm,  respectively.  Then 
the  k-tli  prolongation  ofZ  =  X  +  Y  is  given  by 

Pr  {k)Z  =  Z+  £  LY?L  (5.9) 

l<|/|<t  a  °UJ 

where 

Yja  =  DjQa  +  £  Qa  =  Ya  -  £  u“X‘. 

Proof.  Since  X  and  Y  commute,  the  flow  y>rz  of  Z  =  X  +  Y  decomposes  as  the 
composition  (pf  =  <pf  o  cpj  of  the  flows  of  its  summands.  This  implies  that  (pf  = 
(pf  o  yyF,  where  <pz ,  <p,F ,(pf  denote  the  induced  flows  on  Jet*(R",  Rm).  Differentiation 
with  respect  to  1  yields  that 

pr{k)Z  =  pr{k)X  +  pr{k)Y. 


d 

dt 


Therefore,  it  suffices  to  check  the  formula  (5.9)  in  the  special  cases  X  =  0  and  Y  =  0. 

We  first  consider  the  case  X  =  0.  Then  Qa  =  Y"  and  Yf  =  DjYa.  So,  what  we 
have  to  show  is 

pr«F  =  F+  £  £DyFfl^.  (5.10) 

l<|J|<Jt  a  dUJ 

Let  us  denote  by  y/f  the  flow  of  F  on  R'"  and  by  gt  =  (Id,  y/,)  the  corresponding  local 
one -parameter  group  of  local  diffeomorphisms  of  R"  x  R'”.  The  induced  action  on 
Jet^R",  Rm)  is  given  by 

gt  ■  jxf  =  jkx(Vt  °  f)  =  (x,V,(f{x)),{dj(y?  of)(x))}  i<|jr|<jfc>a=i>...>OT) 

for  all  /  £  C°°(R",  Rm)  and  x  G  R",  where  y/f  =  ya  o  y/,.  Differentiating  this  equation 
with  respect  to  t  yields 

Pr^Uy  =  (0,F/w,{a/(Fao/)(x)}1<w<ifeia=li...jm) 

=  (O ,  Fy (jc) ,  { (DjYa )(jxf)}l<\J\<k,a=l,... ,m )  ■ 

This  is  precisely  the  right-hand  side  of  (5.10). 

Next  we  assume  F  =  0,  in  which  case  Qa  =  —  Y^ufX1.  Firstly,  we  consider  the 
case  k  =  1  as  a  warm  up.  Now 
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Yf  =  DjQa  +  £  uljX‘  =  -  £  ifdjX1. 

So,  what  we  have  to  show  is 

pr  Wx=X-'£^djXi^-5.  (5.11) 

dUj 

Let  us  denote  by  (pt  the  flow  of  X  and  by  gt  =  (<pf,Id)  the  corresponding  local 
one-parameter  group  of  local  diffeomorphisms  of  IR"  x  Rm.  It  acts  on  Jet1  (IR”,  Rm)  by 

8t  ■  jxf  =  j\  (a)  (/  °  <Pf“ ' )  =  (<pt  0)  ,/(*)>{  dj  {f  Oft1)  I  (ft  (a)  }j,a  )  • 


Differentiation  with  respect  to  t  yields 


t=o 


8.fj=[x„  0,1  | 


r=0 


dj(fa°<P, 


We  compute 


d{f°<Pt  ')U(a )=dfod(<P,  X)U(a)  =  [Or  lYdf]%(x) 


In  order  to  differentiate  this  with  respect  to  t ,  we  consider  the  time-dependent  matrix¬ 
valued  function  Ft(x)  :=  F(t,x)  :=  ((pt~l)*df\x.  So,  what  we  have  to  differentiate  is 
F(t.  <p,(x)).  Its  total  time-derivative  is 


F{t,<Pt(x))  =  -j-  (0,x)  +  dF0Xx  =  -Lxdf\x  +  (Hess/) (Xx,  •) 
t=o  "t 

=  -d(dfX)\x+{Hzssf)(Xx,-)  =  -dfodX\x. 


This  shows  that 


d 

dt 


t= 0 


(djifoep-Xxx)) 


-dfa  o  dX  | xdj  =  -  £  difdjX1  (x) 

-CLWjx  o  ai/), 


proving  (5. 1 1)  in  the  case  £  =  1 . 

Let  us  finally  consider  the  case  of  general  k  >  1.  We  first  claim  that  for  every 
multi-index  of  length  <k  we  have 

{DjQa){jk+lf)  =  ~djX(fa).  (5.12) 


In  fact, 

efl(/7)  =  -£«*')(//)  =  =  -*cn 

implies  (5.12).  As  a  consequence, 

7a(/+V)  =  (0/2“  +  E«l^0  (/+7)  =  +I(^/a)x!'. 


54 


5  Classical  field  theory 


So,  what  we  have  to  show  is 


d 

dt 


{dj(fa°(P, 
t= o 


(5.13) 


To  calculate  the  left-hand  side  we  put  Ft(x)  :=  F(t,x)  :=  dj(fa  o  (pt  )x.  Then  we  see 
that 


d 

dt 


t= 0 


d 

dt 


F{t,<Pt{x)) 
t= o 


dF 

dt 


(0  ,x)+dF0Xx 


-djX(fa)\x  +  d(djfa)X\x. 


This  coincides  with  the  right-hand  side  of  (5. 13).  □ 

Now  we  can  prove  Noether’s  theorem. 

Proof,  (of  Theorem  5.10)  We  have  to  show  that  'lQ" Fa(,rX')  is  a  total  divergence. 
By  partial  integration  we  have 

d  9’  d  9 

Y.VU2) = E  +Div1' 

for  some  vector  valued  function  V  =  (V1, . . .  ,V”)  on  Jet2*-1  (IR",  1R"1).  It  suffices  to 
show  that  Y.{DjQ11)^  is  a  total  divergence.  Recall  that  we  defined  Yf  =  DjQa  + 

itj  for  1  <  |/|  <  k.  The  formula  holds  also  for  |/|  =0: 

Ya  =  Qa  +YJu"Xi  ■ 


Using  this,  Lemma  5.13,  and  Lemma  5. 12  we  have 


Y,(DjQ“) 


dF£ 

duaj 


IW-I  =  (-^+Pr 

-  YtXiDiSf  +  (pr {k)Z)^  =  -  YXDi^  -  -^divX 

-Yjd^x1), 


which  is  a  total  divergence.  □ 

In  the  proof  of  Theorem  5.10  we  have  shown  that  Y^Qa  Ea(T£)  is  the  total  di¬ 
vergence  of  the  vector-valued  function  with  components  Pl  =V‘  —  JfX1,  where 
X  =  Y,Xldj  is  the  projection  of  the  infinitesimal  automorphism  Z  onto  the  source 
manifold  IR"  and  V'  is  a  function  on  Jet2*- 1  (IR” ,  IR"')  obtained  by  partial  integration. 
This  means  that  P  =  (Pl, . . . .  P" )  can  be  explicitly  computed  by  performing  the 
partial  integrations.  In  the  case  k  =  1  we  obtain  the  following  result. 

Corollary  5.14.  Let  ££  £  C°°(Jet1(IR",IRm))  be  a  first  order  Lagrangian.  Then 
Noether’s  conservation  law  associated  with  an  infinitesimal  automorphism  Z  = 


5.2  Automorphisms  and  conservation  laws 
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X  +  Y  as  in  Theorem  5.10  is  the  total  divergence  of  the  vector-valued  function 
P  =  (P1, . . . .  Pn)  on  Jet1  ((R" ,  (Rm)  given  by 

P'  =  -YQc'^ - Jz?X!,  Qa  =  Ya-YuaiXJ. 

du“  1 

Proof  It  suffices  to  observe  that  V 1  :=  —  satisfies 


-E<2°A 


di? 


L(a Qa) 


d .Sf 

du“ 


+  Y°‘V‘- 


□ 


As  another  corollary  we  obtain  the  generalization  to  time-dependent  Lagrangian 
mechanical  systems  on  Rm  of  Noether’s  theorem  (Theorem  2.12),  which  concerned 
time-independent  mechanical  systems. 

Corollary  5.15.  Let  2z?  =  fiC(t,q1,...  ,q'n,ql , . . . , qm )  be  a  time-dependent  Lag¬ 
rangian  mechanical  system  on  Rm.  Then  every  infinitesimal  automorphism  Z  =  X  +  Y, 
X  =  Xldt  £  £(R),  Y  =  £  Yadqa  £  X(Rm),  of  Jfdt  gives  rise  to  an  integral  of  motion 

r)  5^ 

f=-pl=Y.Qa~^+^’x^  Qa=Ya~Yrxl- 


Example  5.16.  Notice  that  in  the  case  X  =  0,  an  infinitesimal  automorphism  Y  £ 
£(Rm)  of  Hfidt  is  the  same  as  an  infinitesimal  automorphism  of  2z?  and  the  integral 
of  motion  takes  the  familiar  form  of  Theorem  2.12: 


f  =  YYa^—=df£Y'1 
J  ^  dqa 


The  prolongation  of  a  vector  field  Y  £  £(Rm)  given  in  (5.10)  simplifies  in  the  con¬ 
sidered  case  k  =  1  =n  as 


pr(1)T 


Y 


YjD>Ya 


d 

dqa 


and  thus  Y  is  an  infinitesimal  automorphism  if  and  only  if 


Y(fif)+YDtYa 


dT£ 

dqa 


=  0. 


Evaluating  this  along  the  velocity  vector  field  /  of  a  smooth  curve  y :  I  — >  Rm,  we 
obtain 


YY%t) 


d#(Y(t)) 

dqa 


where  Ya(t )  :=  Y“(y(t)).  So  Y  is  an  infinitesimal  automorphism  if  and  only  if  the 
latter  equation  holds  for  all  y. 
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Next  we  state  Noether’s  theorem  for  time-dependent  Lagrangian  mechanical 
systems  in  its  general  form  replacing  lRm  by  an  arbitrary  smooth  manifold. 

Theorem  5.17.  Let  Jz?  e  C°°( IR  x  T M)  be  a  time-dependent  Lagrangian  mechanical 
system  on  a  smooth  manifold  M.  Then  every  infinitesimal  automorphism  Z  =  X  +  Y, 
X  =X1dt  £  £(R),  Y  £  X(M),  ofXfdt  gives  rise  to  an  integral  of  motion 

f  =  Ywer(&) +  (£’-$  {&))x\  (5.14) 

where  E,  £  X(TM)  denotes  as  usual  the  vertical  vector  field  generated  by  scalar 
multiplication  (s,v)  |— >  esv  in  the  fibers  ofTM  and  Fver  denotes  the  vertical  lift  ofY. 

Proof  Let  (U,  (p)  be  a  local  chart  of  M.  According  to  Corollary  5.15,  we  have  the 
following  integral  of  motion  on  TU: 

f(u,<P)=IJQaW+^x\  <?  =  Ya-E?X1, 

where  Ya,  a  =  1, . . .  ,m,  m  =  dimM,  are  the  components  of  Y  with  respect  to  the 
local  chart  (ql, . . .  ,qm)  =  (p  and  (t,q1,...,qm,ql,...,qm)  are  the  corresponding 
local  coordinates  of  IR  x  TM  =  Jet'([R,M),  defined  on  IR  x  TU  =  Jet1  (IR, t/).  Since 
£ Ya  fla  =  YveT\v  and  f^qa  =  E,\u,  we  see  that  f^u.y)  is  the  restriction  of  the 
globally  defined  (and  manifestly1  coordinate  independent)  function  /.  This  shows 
that  /  is  an  integral  of  motion.  □ 

Example  5.18.  Consider  the  special  case  when  Y  =  0.  The  expression  for  the  pro¬ 
longation  ofX  =Xldt  £  IR  on  Jet1  (IR, Af )  in  local  coordinates  (t,q,q)  follows  imme¬ 
diately  from  (5.11 ): 

pr(l)X  =X-£qadlXl^a.  (5.15) 

The  vector  field  pr'hf  is  in  fact  coordinate  independent  and  obviously  coincides 
with  X  —  dtX1^.  We  conclude  that  X  is  an  infinitesimal  automorphism  of  JZdt  if  and 
only  if 


0  =  X(£')-d,Xl  £,(£’)  +  LZdtX 1  =  A  (if )  +  ( jgf  —  %  (££ )  )d,  X 1 . 

The  right-hand  side  vanishes,  in  particular,  if  JZ  is  time-independent  and  X  =  dt. 
The  corresponding  integral  of  motion  (5.14)  is  precisely 

the  energy,  up  to  the  factor  —1.  So  we  see  that  the  energy  is  the  integral  of  motion 
associated  with  the  invariance  of  the  Lagrangian  under  translations  in  time. 

The  notion  of  an  infinitesimal  automorphism  of  a  Lagrangian  n-form  can  be 
generalized  as  follows. 

1  Note  that  in  the  definition  of  /,  we  did  not  use  any  coordinates. 


5.3  Why  are  conservation  laws  called  conservation  laws? 
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Definition  5.19.  Let  X  and  Y  be  smooth  vector  fields  on  IR"  and  IR'",  respectively, 
££  £  C°°(JetA(R",  R"1))  a  Lagrangian  and  dvol  the  standard  volume  form  o/R".  We 
say  that  Z  =  X  +  Y  is  an  infinitesimal  automorphism  of  Jz? dvol  up  to  a  divergence  if 

(prWZ)(^f) +«5fdivX 


is  a  total  divergence. 

Noether’s  theorem  can  be  easily  generalized  as  follows  (see  Appendix  A,  Exer¬ 
cise  40). 

Theorem  5.20  (Noether).  Under  the  above  assumptions,  let  Z  =  X  +  Y  be  an  infin¬ 
itesimal  automorphism  ofJZ’dvol  up  to  a  divergence.  Then 

£  QaEa  ( Jgf) ,  defined  by  Qa  =Ya  -  £  ufX* , 

is  a  total  divergence,  where  X'  and  Ya  are  the  components  ofX  £  X(R”)  and  Y  £ 

£(Rm). 


5.3  Why  are  conservation  laws  called  conservation  laws? 

Consider  a  classical  field  theory  defined  by  a  Lagrangian  n-form  JZ’dvol,  where 
££  £  C°°(JetA'(R",  IR"'))  and  dvol  is  the  standard  volume  form  on  IR".  We  denote 
the  standard  coordinates  of  IR"  by  (x[) ...  ,xn  1 )  and  think  of  t  =  x°  as  the  time- 
coordinate  and  of  x  =  (x1, . . .  ,x"_1)  as  the  spatial  position.  Suppose  that  we  are 
given  a  conservation  law  DivP,  P  £  C°°(Jet^(R",  R'"),  R"),  l>k.  We  denote  by 

j(t,x)-.=p(fxf), 

the  evaluation  of  P  on  a  solution  /  £  C°°(R",  Rm). 

Definition  5.21.  The  vector-valued  function 


J  =  (/°,  J) :  R"  ->  R"  =  R  x  R"_1 

is  called  the  current,  the  function  J°  is  called  the  charge  density,  J  is  called  the  flux 
density  and  the  spatial  integral 

Q{t):=  [  J° (t ,x)dxl  /\  ■  ■  ■  A  dxn~ 1  (5.16) 

JR"-1 


is  called  the  charge. 

IfP  is  associated  with  an  infinitesimal  automorphism  Z  of  JZ1 dvol,  as  described 
in  Theorem  5.10  (or,  more  generally,  in  Theorem  5.20),  then  J  is  called  the  Noether 
current  and  Q  is  called  the  Noether  charge  associated  with  —Z. 

The  sign  is  included  here  such  that  the  notation  of  Theorem  5.10  is  consistent 
with  the  usual  conventions  for  the  Noether  charge  in  the  physics  literature.  With  this 
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notation,  the  Noether  current  of  an  infinitesimal  automorphism  Z  =  X  +  Y  of  a  first 
order  Lagrangian  Jfdvol  on  IR"  is  given  by 


=  I  Yaof-£djfXJ 


(5.17) 


see  Corollary  5.14.  Note  that  in  the  case  n  =  1,  the  current  reduces  to  the  charge 
density,  J  =  J°  =  Q,  and  as  we  already  observed,  the  equation  DivT*  =  0  reduces  to 
the  statement  that  P  is  an  integral  of  motion,  that  is  to  Q'(t)  =  0  (for  all  solutions  /). 
This  is  generalized  in  the  next  theorem. 


Theorem  5.22.  Suppose  that  the  current  falls  off  sufficiently  fast  at  infinity,  in  the 
sense  that  for  every  bounded  interval  I  there  exists  a  Lebesgue  integrable  function 
F  :  IR'1"1  — >  [0,°°]  such  that 


iAm)| 

dj° 


dt 


(t,x) 


<  F (x), 

<  F (x), 


for  all  (f,x)  €  /  x  IR"  1  and 


lim  /  (J, v)dvoldBr0)  =  0, 

r^°°JdBr(  0)  1  ' 

where  V  denotes  the  outer  unit  normal  of  the  sphere  dBr(  0)  C  R”"1.  Then  the  charge 
is  conseri’ed,  that  is  t 1 1— >  Q{t)  is  constant. 

Proof  In  virtue  of  the  first  inequality,  we  see  that  the  total  charge  (5.16)  is  finite  by 
Lebesgue’s  theorem.  The  second  inequality  then  shows  that  Q  is  differentiable  and 

r  dT° 

Q'(t)  =  / 

J\ r»-i  dt 

where  we  have  abbreviated  dn~lx  =  dxl  A  •  •  •  A dxn~ 1 .  Since 

div/  =  d,J°  +divR„-i  J  =  0, 


we  can  rewrite  this  as 

Q!(t)  =  —  divR„-iJ(t,x)dn~lx  =  -lim  divR„-i  J(t,x)dn~1x 
Jr"~1  r^°°JBr(  0) 

=  —  lim  /  (J,v)dvoldB  (0), 
r^°°  J  dBr(0) 

and  the  resulting  limit  is  zero  by  the  last  assumption  of  the  theorem.  □ 


5.4  Examples  of  field  theories 
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5.4  Examples  of  field  theories 

Here  we  discuss  some  examples  of  classical  field  theories.  The  examples  play  import¬ 
ant  roles  in  active  research  on  contemporary  theories  of  high-energy  particle  physics, 
such  as  the  standard  model  of  particle  physics  and  string  theory. 


5.4.1  Sigma  models 


Let  (. M,g )  and  (N,h)  be  pseudo-Riemannian  manifolds  of  dimension  m  and  n, 
respectively.  We  assume  that  (N,h)  is  oriented  and  denote  its  volume  form  by  dvolh- 
The  most  natural2  first  order  Lagrangian  n-form  ffdvolh  for  maps  /  £  C°°(N,M)  is 
given  by 

&Ulf)  =  \(df,df),  (5.18) 

where  (•,•)  =  (■■,■)  h.g  is  the  (possibly  indefinite)  fiber-wise  scalar  product  on  T*N  <g» 
f*TM  induced  by  h  and  g.  In  local  coordinates  (x‘ )  in  a  neighborhood  of  x  £  N  and 
(y°)  in  a  neighborhood  of  f(x)  £  M  we  have 


dfx  =  £ 


dfa(x) 

r)x‘ 


dx'  \x  (g> 


_d_ 

dya 


m 


and 

X-1  dfa(x)  d  fb{x) 

(dfx,dfx)  =  YJSab{f{x))h]{x)—^i - 

where  (h1*)  denotes  the  inverse  matrix  of  (hij). 

Theorem  5.23.  The  Euler-Lagrange  equations  for  the  Lagrangian  n-form  Jfdvol/, 
given  by  (5.18)  are  equivalent  to 


T(/):=tr*Vd/  =  0, 


(5.19) 


where  V  is  the  connection  in  the  vector  bundle  T*N (g)  f*TM  induced  by  the  Levi- 
Civita  connections  in  TN  and  TM. 


Proof.  See  Appendix  A,  Exercise  39.  □ 

Definition  5.24.  A  smooth  map  f  :  /V  — >  M  which  is  a  solution  ofeq.  (5.19)  is  called 
a  harmonic  map.  The  vector  field  T  (/)  along  f  is  called  the  tension  of  f. 

Example  5.25.  (Harmonic  functions)  Consider  the  case  when  the  target  manifold 
(M,g)  is  simply  the  Euclidean  line  [R.  Then  f*TM  can  be  identified  with  the  trivial 
line  bundle  over  N  and 


2  This  is  to  be  understood  in  the  sense  that  it  generalizes  the  kinetic  term  in  the  standard  Lag¬ 
rangian  (2.4)  of  classical  mechanics. 
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z(f)  =4/:=divgrad/ 

is  given  by  the  Laplace  operator  A  associated  with  the  pseudo-Riemannian  metric  h 
on  the  source  manifold  N.  The  solutions  of  the  equation  Af  =  0  are  called  harmonic 
functions.  Recall  that 

grad  /  =  hTxdf 


and 


divX  =  trVX 


for  every  vector  field  X  on  (N,h).  So  indeed 


div grad  /  =  trV(/z  ]df)  =  tr hVdf  =  X (/). 


Here  we  have  used  that  Vy(h~ldf)  =  /z~'V \,df  for  every  v  £  TN.  Note  that  the 
pseudo-Riemannian  Laplace  equation  Af  =  0  is  linear,  whereas  the  harmonic  map 
equation  (5.19)  is  in  general  non-linear,  since  the  connection  V  depends  on  f. 
Observe  that  in  the  special  case  when  (N,  h)  is  a  pseudo-Euclidean  vector  space  we 
can  write  the  metric  as  h  =  Y,£i(dx')2,  where  £,■  £  {±1},  and  Af  =  ££,<92. 

Example  5.26.  Generalizing  the  previous  example,  we  consider  the  case  when  the 
target  manifold  ( M,g )  is  a  pseudo-Euclidean  vector  space.  Then  the  components 
X “(/)  of  the  tension  X  (/)  with  respect  to  an  affine  coordinate  system  on  the  target 
manifold  M  are  given  by 

x  a(f)=Af. 

So  a  smooth  map  f  from  a  pseudo-Riemannian  manifold  to  a  pseudo-Euclidean 
vector  space  is  harmonic  if  and  only  if  its  components  fa  are  harmonic  functions. 

In  the  physics  literature  the  Lagrangian  (5.18)  is  called  a  sigma-model  and  in  the 
case  of  Example  5.26  it  is  called  a  linear  sigma  model ,  since  the  equations  of  motion 
are  linear.  The  components  of  the  map  /  :  N  — >  M  are  considered  as  scalar  fields.  We 
can  enlarge  the  class  of  sigma-model  Lagrangians  by  including  a  potential: 

^U1f)  =  \(df,df)-V(f), 


where  V  £  C°°(M). 

Example  5.27.  (Geodesics  as  harmonic  maps)  Consider  the  case  when  the  source 
manifold  ( N,h )  is  the  Euclidean  line  IR.  Then  a  harmonic  map  /  :  IR  — >•  ( M,g )  is  the 
same  as  a  geodesic. 

Proposition  5.28.  The  Lie  group  Isom+  (N ,  h)  x  Isom(M,  g)  consists  of  automorph¬ 
isms  of  the  sigma-model  Lagrangian  n-form  Jfdvol^  defined  by  (5.18). 

Proof.  This  follows  from  the  fact  that  the  group  Lorn  1  (N ,h)  of  orientation  pre¬ 
serving  isometries  of  (Nfi)  preserves  the  metric  volume  form  dvof,  and  that 
Isom(lV,/2)  x  Isom(M,g)  preserves  the  Lagrangian  Jzf.  □ 


5.4  Examples  of  field  theories 
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5.4.2  Pure  Yang-Mills  theory 

Let  £  be  a  real  or  complex  vector  bundle  of  rank  k  over  an  oriented  pseudo- 
Riemannian  manifold  (. M,g )  with  a  reduction  of  the  structure  group  of  £  to  some 
compact3  subgroup  G  C  GL(£,  K),  where  IK  =  [R  or  C.  Such  a  G-reduction  in  the 
vector  bundle  E  is  defined  as  a  principal  G-subbundle  ^q(E)  C  J^(£)  of  the  bundle 
of  frames  of  E.  The  elements  of  J^g(E)  are  called  G-frames. 

Example  5.29.  An  0(k) -reduction  in  a  real  vector  bundle  E  is  equivalent  to  a 
(positive  definite  fiber)  metric  h  in  E.  The  corresponding  principal  0(k)- subbundle 
of  the  frame  bundle  E )  is  the  bundle  of  orthonormal  frames  in  the  metric  vector 
bundle  ( E,h ).  A  \J(k)-reduction  in  a  complex  vector  bundle  E  is  equivalent  to  a 
Hermitian  metric  h  in  E.  The  corresponding  principal  \J(k)-subbundle  of  the  frame 
bundle  E )  is  the  bundle  of  unitary  frames  in  the  Hermitian  vector  bundle  (£,/t). 

Note  that  since  G  is  compact  every  G-reduction  induces  a  metric  h  in  E  if  IK  =  IR 
and  a  Hermitian  metric  in  E  if  IK  =  C.  A  connection  V  in  a  vector  bundle  E  endowed 
with  a  G-reduction  q(E )  is  called  a  G-connection  if  the  parallel  transport  with 
respect  to  V  maps  G-frames  to  G-frames.  We  denote  the  curvature  of  V  by 

F  =FV  er(A2T*M®d(E)), 

where  q(Ex)  denotes  the  Lie  algebra  of  the  subgroup  G{EX)  C  GL(£A)  of  elements 
preserving  the  set  of  G-frames  in  Ex,  x  G  M.  For  convenience,  we  will  write  so(E), 
su (E),  01(E)  etc.  rather  than  so(£,  IR)(£),  s u(k)(E),  g[(£,  K)(£)  etc. 

Example  5.30.  An  0{k)-connection  in  a  metric  vector  bundle  ( Eft )  of  rank  k  is  the 
same  as  a  metric  connection.  A  U (k) -connection  in  a  Hermitian  vector  bundle  ( E,h ) 
of  rank  k  is  the  same  as  a  Hermitian  connection. 

Given  a  vector  bundle  E  with  structure  group  G  over  a  pseudo-Riemannian 
manifold  (. M,g ),  the  space  of  fields  of  pure  Yang-Mills  theory  is  the  affine  space 
s/g{E)  of  G-connections  in  E.  By  choosing  a  reference  connection  V°  we  can  write 
V  =  V°  +  <J>,  where  <J> :  M  — >  T* M  <S>  0(E)  is  a  smooth  section.  In  this  way  we  can 
identify  the  affine  space  £#g(E)  with  the  vector  space  T (T'M 0  0(E)).  Using  this 
identification,  one  can  define  jets  of  V  by  considering  jets  of  the  map  <J>  =  V  —  V°. 
The  Yang-Mills  Lagrangian  is 

■^aI(V))  =  -^(E,E),  (5.20) 

where  (•,  •)  is  the  (fiber- wise)  scalar  product  on  A2T*M®q(E)  obtained  as  the  tensor 
product  of  the  scalar  product  (v)A  on  A2T*M  induced  by  g  and  the  scalar  product 
(. A,B)S  =  — tr  (AB)  on  g(£).  Notice  that  (A,A)S  =  —  trA2  =  trAA^  >  0,  where  A ^ 
denotes  the  adjoint  of  A  with  respect  to  the  G-invariant  metric  h.  In  particular,  (•,  -)0 
is  real  valued,  irrespective  of  whether  E  is  a  real  or  a  complex  vector  bundle.  The 


3  Recall  that  a  compact  subgroup  of  a  Lie  group  is  automatically  a  Lie  subgroup. 
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scalar  product  <V>A  is  normalized  such  that  a  A  /3  is  of  unit  length  if  a,  p  are 
orthonormal. 

Recall  that  every  connection  satisfies  the  Bianchi  identity 

dvF  =  0,  (5.21) 

where  dv  is  the  covariant  exterior  derivative  acting  on  differential  forms  with  values 
in  End(£)  D  g(E). 

Theorem  5.31.  The  Euler-Lagrange  equations  for  the  Yang-Mills  Lagrangian 
(5.20)  are  equivalent  to 

dv*F  =  0.  (5.22) 

Before  we  begin  the  proof  of  this  proposition  let  us  first  discuss  an  example. 

Example  5.32.  (Maxwell  theory)  Let  us  consider  the  special  case  when  (M  .g)  = 
(R \dt2  -  Y.a=\(.dxa)2)  is  the  four-dimensional  Minkowski  space  and  E  is  a  Her- 
mitian  line  bundle.  So  the  structure  group  is  the  Abelian  group  G  =  U(  1 ).  Identifying 
the  Lie  algebra  u(l)  =  \J  —  1R  =  R  with  R,  we  can  consider  F  as  a  real-valued  2-form. 
The  Bianchi  and  Yang-Mills  equations  reduce  to  dF  =  d  *  F  =  0.  So  the  differential 
form  F  is  closed  and  co-closed,  which  implies  the  second  order  equations  *d  *  dF  =  0 
andd*d*F  =0.  The  spatial  components  Fap  =  F(daidp),  a, ft  £  {1,2,3},  define 
a  real-valued  time-dependent  2-form  in  R3.  Identifying  R3  =  A2(R3)*  by  means  of 
contraction  of  a  vector  with  the  Euclidean  volume  form,  this  2-form  defines  a  time- 
dependent  vector  field  B  in  R3.  The  remaining  components  Ea  :=  Foa  =  F(dr,da), 
a  =  1,2,3,  define  a  time-dependent  vector  field  E  =  Eada  in  R3.  The  vector 
fields  E  and  B  can  be  interpreted  as  the  electric  and  the  magnetic  field  in  Maxwell’s 
theory  of  electromagnetism  and  the  Yang-Mills  equation  reduces  to  half  of  Maxwell’s 
equations  in  the  vacuum,  that  is 


divE  =  0, 


rotB  = 


whereas  the  Bianchi  identity  reduces  to  the  other  half  of  Maxwell’s  vacuum  equations, 
that  is 


divB  =  0, 


rot  E  =  — 


see  Appendix  A,  Exercise  43. 


In  order  to  derive  the  Yang-Mills  equation  (5.22)  it  is  helpful  to  first  rewrite  the 
Yang-Mills  Lagrangian  as: 


-^(F,F)dvolg  =  -^(FA*F)g,  (5.23) 

where  (•  A-)0  denotes  the  A" T*M- valued  pairing  of  AtT*M®g(E)  with  An~lT*M® 
q(E)  obtained  by  combining  the  wedge  product  and  the  scalar  product  on  g (E ) .  Here 
n  =  dimM  and  £  =  2,  but  in  later  calculations  we  will  also  need  the  case  £=  1.  This 
is  a  consequence  of  the  following  lemma. 
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Lemma  5.33.  For  every  section  a  of  A2T* M  0  g(E)  we  have 
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{a,a)dvolg  =  (a  A*a)g.  (5.24) 

Proof.  Let  (e,),=  i  „  be  a  local  frame  of  TM  and  put  a,-,-  :=  a(e,-,e;).  Then  we  can 
write 

«  =  fLa‘JFF 

where  (el)  is  the  dual  local  frame  of  T  M  and  we  have  abbreviated  e'  A  eJ  =:  e‘f 
Now  we  can  compute 

(a  A*a)g  =  ^-£(a,v,aw) ge,J  A*eke 

=  —  £<a„,  CCkf) g(e  f  e  )Advolg 
=  (a,a)dvolg. 


□ 

Proposition  5.34.  Let  (e,j  be  any  local  frame  in  TM  and  (e‘  )  the  dual  frame  in 
T*M.  Then  the  Yang-Mills  Lagrangian  is  locally  given  by 

where  Fij=F(ei,ej),  Flj  =  £#"' g11' Fy /,  g'j  =  g~l(el,ej). 

Proof.  Choosing  an  orthonormal  frame  (e,).  the  calculation  in  the  proof  of  the 
previous  lemma  shows  that 

-\{F,p)  =  -jE(Fij,F* -i)B(eij,eke)A 

=  ~^^L/(Fij,Fij)  g(e'j  ,e'j)A  =  -\^Fu,F,j)  0  =  ^£tr(^). 

It  is  easy  to  check  that  the  expression  }_  Fjj  F‘J  is  independent  of  the  choice  of 
frame.  This  proves  that  the  formula  —j(F,F)  =  5  Y.lr (FijF'T)  holds  in  every  local 
frame.  □ 

As  a  corollary  we  obtain  the  usual  expression  for  the  Yang-Mills  Lagrangian  in 
the  physics  literature. 

Corollary  5.35.  In  local  coordinates  (x^ ),  /i  =  1 the  Yang-Mills  Lagrangian 
on  ( M,g )  is  given  by 

-{(F,F)  =  \'L*(FgvFn, 

where  Fllv=F(dll,dv),  F^  =  gvv'  fyv,,  g»v  =  g~\dx^  dxv). 
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Lemma  5.36.  Let  V  be  a  G-connection  in  E  and 

aen  l{Q{E))=r(T*M®Q{E)) 
a  one-form  with  values  in  g(F).  Then 

Fv+a  =  FV  +  dva  +  aAa, 

where  (<xAa)(X,Y)  :=  a{X)a{Y )  -  a(Y)a(X)  =  [a(X),a(y)]  for  all  X,Y  G 
X(M). 

Proof  See  Appendix  A,  Exercise  44.  □ 

Corollary  5.37.  Let  V  be  a  G-connection  in  E,  a  G  LI 1  (g(£)),  and  consider  the 
curvature  F£  of  the  one-parameter  family  of  G-connections  Ve  =  V  +  ea.  Then 

{Fe  A *Fg)g  =  2{d  CZA*F)g1 

e=0 

where  F  =  Fq. 

Proof  Since  Fe  =  F  +  edv a  +  if1  a  A  a,  we  have 

( Fe,Fe)dvolg  =  (dv  a,F)dvolg. 

£=0 

□ 


1  d 

2  de 


£— 0 


(F£A*Fe)g-~  — 


d 

de 


Proof  (of  Theorem  5.31)  Using  the  preceding  notation  we  compute 


d 

de 


J*f(j1(ys))dv°lg  =  (dvaA*F)g  =  —  tr(c/va  A  *F) 

e=0 

=  —tr^dv(aA*F)  +  aAdv*F'j 
=  —tvdv(aA*F)  +  (aAdv*F)g. 


Since 


trc/v(aA*U)  =dtr(aA*F), 

we  conclude  that  for  variations  a  with  compact  support  we  have 
f  d 

£—0 


JM 


de 


J?(j\Ve))dvolg  =  f  (a  Adv  *F)gdvolg. 
1  JM 


This  proves  that  the  equations  of  motion  of  pure  Yang-Mills  theory  are  indeed 
dv*F  =  0.  □ 


5.4  Examples  of  field  theories 


65 


Gauge  transformations 


Given  a  vector  bundle  £  — >■  M,  we  denote  by  Aut(£)  its  group  of  automorphisms.  If 
the  vector  bundle  is  equipped  with  a  G-reduction  q(E )  C  J^(£),  then  the  subgroup 
of  Aut(£)  consisting  of  those  automorphisms  which  preserve  the  reduction  will  be 
denoted  by  Autc(£).  It  coincides  with  the  group  of  sections  of  the  group  bundle 
G(E)  C  GL(£). 

Definition  5.38.  Aute(£)  is  called  the  gauge  group  of  the  vector  bundle  E  with 
G-reduction  q(E ).  The  elements  ofAutc(E)  are  called  gauge  transformations. 

Gauge  transformations  play  a  similar  role  in  the  study  of  vector  bundles  to  diffeo- 
morphisms  in  the  study  of  manifolds.  The  group  Aut(£)  acts  C°°(M) -linearly  on  the 
space  of  smooth  sections  F (£)  by 

Aut(£)  x  r(E)  — >  r(E),  (<p,s)  ->  (ps. 

Proposition  5.39.  The  action  of  Autg(E)  on  F(£ )  induces  an  affine  action  on  the 
space  £?g{E)  of  G-connections. 

Proof  The  transformation  of  .sYq(E')  induced  by  (p  £  Aut g(E)  is 

V  i-j-  V'  =  :=  tpo Votp~l.  (5.25) 

To  see  that  it  is  affine,  we  apply  V'  to  a  section  s  £  F(£): 

Vs  =  (p(V(p^1s )  =  cp'V((p~1)s  +  Vs. 


So 


V'  =  V  +  rpVtrjT1) 


(5.26) 


is  related  to  V  by  a  translation. 


□ 


Proposition  5.40.  Let  V  be  a  connection  in  a  vector  bundle  E  — >  M  and  (p  G  Aut(£) 
an  automorphism.  Then  the  curvature  F'  of  the  gauge  transformed  connection 
V'  =  V'P  is  related  to  the  curvature  F  ofV  by  the  natural  C°°  (M)- linear  action  of 
Aut(£)  on  f22(End(£')); 

F'  =  tp  oF  o  (p  1 . 


Proof  This  is  a  straightforward  consequence  of  (5.25)  and  the  definition  of  the 
curvature.  □ 


Corollary  5.41.  The  Yang-Mills  Lagrangian  (5.20)  is  invariant  under  Aut(£). 

Proof  This  follows  from  the  fact  that  the  trace  form  (A,B)  i— ?■  tr(AB)  on  01(F), 
which  was  used  in  the  definition  of  (F,F),  is  invariant  under  conjugation.  □ 
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This  clarifies  the  role  of  gauge  transformations  in  Yang-Mills  theory.  For  that 
reason,  Yang-Mills  theory  is  also  known  as  gauge  theory.  In  particular  in  the  physics 
literature,  often  a  further  distinction  is  made  between  Abelian  and  non- Abelian  gauge 
theories  depending  on  the  property  of  the  respective  gauge  group.  In  this  context,  only 
the  latter  is  customarily  referred  to  as  Yang-Mills  theory.  This  additional  distinction 
is  used  not  only  for  historical  reasons  but  also  because  the  corresponding  physical 
theories  differ  significantly  in  both  cases  [9,  12,  15], 

Remark.  Let  us  denote  by  s/qM(E)  C  ( E )  the  subset  consisting  of  Yang-Mills 
connections,  that  is  solutions  to  the  Yang-Mills  equations.  It  follows  from  Corollary 
5.41  that  the  gauge  group  Aut g(E)  acts  on  s/fM(E).  The  quotient 

^M(E)/AutG(E)  (5.27) 

is  the  set  of  equivalence  classes  of  Yang-Mills  connections.  Notice  that  the  space  of 
connections  as  well  as  the  group  of  gauge  transformations  are  of  infinite  dimension. 
Nevertheless  it  is  possible  to  prove  that  under  suitable  extra  assumptions  (on  the  sig¬ 
nature  of  the  metric  g,  the  connections,  and  gauge  transformations),  quotients  similar 
to  (5.27)  are  sometimes  finite-dimensional  smooth  manifolds  encoding  differential 
topological  information  about  the  smooth  manifold  M.  These  problems  are  studied  in 
mathematical  work  on  gauge  theory.  A  particularly  successful  instance  is  Donaldson 
theory  (see,  for  example,  ref.  [4]),  which  is  concerned  with  four-dimensional  smooth 
manifolds  M,  and  restricts  attention  to  the  class  of  self-dual  or  anti-self-dual  con¬ 
nections  (with  respect  to  a  Riemannian  metric  g),  that  is  connections  satisfying  the 
self-duality  equation 

*F  =  F 


or  the  anti-self-duality  equation 


*F  =  -F. 

In  virtue  of  the  Bianchi  identity,  each  of  these  equations  implies  the  Yang-Mills 
equation: 

dv  *F  =  ±dvF  =  0. 


The  connection  one-form 

Let  V  be  a  connection  in  a  vector  bundle  £->Mof  rank  k.  Given  a  local  frame 
/  =  (/i , . . . ,fk)  of  E.  defined  on  some  open  subset  U  CM,  there  exists  a  system  of 
one-forms  A'-,  i,j  €  such  that 


Vxfi  =  ZAi{X)fj 


(5.28) 


for  all  vector  fields  X  on  U  CM.  The  matrix-valued  one -form 
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A=AV’f=(Aij)iJe{l_k} 

is  called  the  connection  one-form  of  V  with  respect  to  the  local  frame  /.  The  equation 
(5.28)  can  be  written  in  matrix-notation  as 

V/  =  /A. 

Solving  it  with  respect  to  A  we  obtain 

A=rlvf, 

where  /-1  =  (/j*, . . .  ,ff)T  is  the  column  vector  which  is  the  transposed  of  the  dual 
frame  f*  =  (/ *  The  notation  is  consistent  with  the  interpretation  of  a  frame 

b  =  [b\,...,bk)  of  E  atxGM  (such  as  b  =  fx)  as  an  isomorphism 

b  :  IK*  — ►  Ex,  v  — >  bv  =  ^ vlbt, 

where  bv  is  the  product  of  the  Ex-valued  row  vector  b  =  [b\ , . . . ,  b/f)  with  the  column 
vector  v  =  (v1, . . . ,  vA)T.  The  inverse  of  this  map  is  given  by 

Ex->\Kk,  {b\{w),...,b*k{w)y . 

Proposition  5.42.  Let  V  be  a  connection  and  f  a  local  frame  in  a  vector  bundle  E 
over  IK  £  { IR,  C}.  The  connection  form  A'  := Av  ’f  of  a  gauge  transformed  connection 
V'  =  V'P,  cp  £  Aut(E),  is  related  to  the  connection  form  A  =AV^  o/V  by 

A'  =  y/A  y/~ 1  +  yd  ( y/- 1 ) ,  (5.29) 

where  \ff  =  f~lo(pof:U  — >  GL(fc,  K)  is  the  matrix-valued  function  representing 
the  automorphism  (p  in  the  frame  f. 

Proof  Using  that  (p  =  f  o  y/  o  f^1  we  compute 

a:=V'-V(5=}  «pV(9“1)=/vA/-1[(V/)vA-1r1+/V(vA-1r1)] 

=  fyAy-1/- 1  +/V/V(V/-1r ‘)  =  fVAy-'r1  +  fVd(y-l)f-'  +/V(/‘ !) 

and,  hence, 

A'-A=f-la(f)  =  \i/A\i/-1  +  \i/d{\i/-1)+  . 

=-rlvf=-A 


□ 


Proposition  5.43.  With  the  notation  of  the  previous  proposition  we  have 


AV'’/=AV>/', 


where  f  =  <p  1  of  =  fo\f/ 
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Proof.  From  the  formula  ,4V  ^  =  /  1 V/  we  can  immediately  deduce  the  general 
transformation  rule 

AipoVo<p~l ,(pof  =  (5.30) 

Substituting  f  for  /  in  this  formula,  we  obtain  the  claim.  □ 

The  last  proposition  shows  that  a  gauge  transformation  V  i— »  V'  =  VP  =  <p  o 

V  o  <p~*  by  (p  has  the  same  effect  on  the  connection  form  as  a  change  of  frame 
ft- 1  f  =  <p  1  o  y  =  /  o  i//  1  by  the  inverse  transformation  <p  ,  or  equivalently,  by 
the  corresponding  change  of  basis  y/  '  in  IK*- 

Substituting  a  one-parameter  group  y/s :  U  — >  GL(£,  IK)  into  (5.29)  and  differenti¬ 
ating  with  respect  to  s,  we  obtain  the  local  expression  for  the  action  on  a  connection 

V  of  an  infinitesimal  gauge  transformation  represented  by  T  :  U  — >  gl(£,  IK): 

A  i->  [t,A]  -dr. 

To  apply  Noether’s  theorem  (Theorem  5.10)  and  to  compute  the  Noether  current 
(5.17)  corresponding  to  an  infinitesimal  gauge  transformation,  one  can  use  the  local 
trivialization  /  of  the  bundle  E  and  a  system  of  coordinates  (mu),  fi  =  1 
on  U  C  M  to  describe  a  connection  V  by  its  connection  form  A  =  )_  A  ^  dx^  = 
Y.A'j^dx^1  <?/,  which  we  consider  as  a  map 

U  -A  U  x  ((R")*  0  fll (Jfc,  IK)),  X  ^  (x,  (A^(x))). 

Here  (ei , . . .  ,ef)  denotes  the  standard  basis  of  Now  one  can  compute  the  corres¬ 
ponding  Noether  current  by  the  formula 

a  9? 

/v=£y(A)^4a1A), 

a  dli“ 

where  a  is  an  index  numerating  the  elements  of  the  basis  ( dx ^  0  e*  0  ef)  of  V  := 
(R")*  0  g l(k,  IK)  such  that 

K(,-'A))  =  (4(/A))  =  (Aj.M)  and  («»(/A))  =  (^(/A))  =  (^). 
The  components  (T“)  =  ( )  of  the  infinitesimal  automorphism 

£(E7  x  V)  3  T  =  :  F  x  V  3  V, 

(x,A)  i— >7(x,A)  =  [t(x),A]  —  dx(x) 

are  given  by 

(x,  A)  -  52  4  4"  W  -  dH  4  W  >  4  =  (A;>  )  ■ 

In  order  to  obtain  an  explicit  expression  for  Jv  it  suffices  to  calculate  the  partial 
derivatives 
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dJSf 

dUy 


U'A) 


where  ((F^v)‘  )(J-6{ is  the  matrix  representing  the  endomorphism  with 
respect  to  the  local  frame  /.  So  we  obtain: 


Proposition  5.44.  The  Noether  current  associated  with  an  infinitesimal  gauge 
transformation  represented  by  T  :  U  —4  gl(£,  IK)  is  given  by 


=tr^f’A'v([T,Av]  —  dvx). 


5.4.3  The  Einstein-Hilbert  Lagrangian 

In  theories  of  gravity  the  space-time  metric  is  no  longer  fixed  but  is  a  dynamical  field 
of  the  theory,  that  is  subject  to  equations  of  motion.  The  most  important  Lagrangian 
for  a  pseudo-Riemannian  metric  g  on  a  smooth  manifold  M  is  the  Einstein-Hilbert 
Lagrangian  n-form 

■%'(j2g)  =  — scaldvoL,  (5.31) 

where  seal  is  the  scalar  curvature  of  (M,g)  and  K  is  the  gravitational  coupling 
constant.  It  describes  pure  gravity,  that  is,  gravity  in  the  absence  of  matter  fields.  More 
complicated  theories  of  gravity  can  be  obtained  by  including  matter  fields  described, 
for  instance,  by  adding  to  the  Einstein-Hilbert  term  a  sigma-model  Lagrangian  (for  a 
map  from  M  to  some  target  manifold)  or  a  Yang-Mills  Lagrangian  (for  a  connection 
in  some  vector  bundle  E  — >  M).  Notice  that  the  fields  of  the  resulting  theory  are 
coupled  through  the  common  dependence  on  the  space-time  metric,  in  the  sense 
that  the  resulting  equations  of  motion  generally  form  a  coupled  system  of  partial 
differential  equations. 

Theorem  5.45.  The  Euler-Lagrange  equations  for  the  Einstein-Hilbert  Lagrangian 
are  equivalent  to  the  Einstein  vacuum  equation: 

Ric  — -seal  ■  g  =  0,  (5.32) 

which  is  equivalent  to  Ric  =  0  if  n  =  dim  M  >  3  and  is  satisfied  for  every  pseudo- 
Riemannian  metric  g  if  n  <  2. 

Proof  Let  ge  be  a  smooth  family  of  pseudo-Riemannian  metrics  on  M  such  that  the 
symmetric  tensor  field 

h  :=  M  g£  (5.33) 

°£  £— 0 

has  compact  support.  In  order  to  compute  the  derivative  of  ( j2g£ )  with  respect  to 
e,  we  first  observe  that 
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d 

de 


dvolge  = 

e=0 


1 

2 


tr(g  1h)dvolg , 


(5.34) 


where  g  =  go-  The  derivative  of  the  curvature  tensor  Re  of  the  Levi-Civita  connection 
VF  of  the  metric  g£  is  expressed  in  terms  of  the  tensor  field 

Ve=^-  (Ve-V)  gi^EndTM) 

e=0  c'£  £— 0 


by 


d 

de 


Re  =  dva, 

e=0 


V  =  v°, 


as  follows  easily  from  Lemma  5.36.  As  a  consequence,  the  Ricci  curvature  Ric£  of 
g£  can  be  computed  as  follows: 


d_ 

de 


RicAX.Y) 

£— 0 


d_ 

de 


tr (Z  ^  Re(Z,X)Y)  =  tr (Z  i-g  dva{Z1X)Y), 

£— 0 


where  X,Y,Z  g  X(M).  Omitting  Z  we  can  write  this  in  terms  of  a  local  frame  (e,)  of 
TM  as 

Rice(X,Y)  =  [( Veicc)xY  -  (Vxa)e,7)] ,  (5.35) 

£—0 

where  ax  =  oc(X)  denotes  the  endomorphism  field  obtained  by  evaluating  the  one- 
form  a  on  the  vector  field  A.  We  have  used  that  dv a(X,Y)Z  =  (Vx^yZ—  (Vya)xZ 
for  all  X,Y,Z  g  X(M).  For  the  scalar  curvature  scal£  of  g£  we  obtain 


d_ 

de 


d_ 

de 


scal£ 

£— 0 


d_ 

de 


£— 0 


(trg£  lRic£)  =  -ti-(g  lhg  lRic ) 


Riff 


£— 0 


:/ 


(5.36) 


where  Ric  denotes  the  Ricci  curvature  of  g.  If  we  can  show  that  /  is  a  total  divergence, 
then  (5.34)  and  (5.36)  imply  that 


scal£dvolge.  = 

£— 0 


lRic  — 


dvolg. 


From  this  formula  we  see  that  the  equations  of  motion  are  equivalent  to  (5.32).  Taking 
the  trace  of  g~lRic  —  ^Id,  we  see  that  every  solution  has  seal  =  0  if  n  ^  2,  which 
implies  Ric  =  0  by  (5.32).  To  compute  /  at  some  point  p  g  M  we  can  assume  that 
X ,,  e j  =  0  at  p.  Then  from  (5.35)  we  compute  at  p: 

f  =  I>JV  [(veia)ejek-  (veja)eiek)\ 

=  E  e'  Ve,  (gjkaej  ek)  -  E  (^'  °  ) , 
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where  g’k  =  g  l(efek).  The  first  sum  is  the  divergence  of  the  vector  field  tr,  a 
obtained  by  contraction  of  the  first  two  factors  of  T*M®T*M®TM  with  the  help 
of  the  metric.  The  other  sum  is  the  divergence  of  the  vector  field  v  =  g  1  fel  o  an 
obtained  by  contraction  of  the  first  and  last  factors  by  duality  and  identification  of 
the  remaining  factor  T*M  with  TM  using  the  metric: 

Y,8jkek^ej(e‘  oaei)  =  Y,8jkek^ejgv  =  Y,8jk8{^ejV,ek)  =  JVVej.v  =  divv. 

So  we  have  proven  that  /  is  the  divergence  of  the  vector  held  trg  a  —  v,  which  does 
not  depend  on  the  particular  frame.  □ 

Let  (M,g,or)  be  an  oriented  pseudo-Riemannian  manifold  and  denote  by  dvolgor 
its  metric  volume  form  with  respect  to  the  orientation  or.  In  local  coordinates 
(x  , . . . ,x")  on  some  open  set  U  C  M,  we  have 

dvolg0r\u  =£\j\  det(gy)  \dx'  A  •  •  •  A  dx",  ££{1,-1},  (5.37) 

if  dxl  A  •  •  •  A  dx n  £  S  or,  where  gij  =  g{dj,dj). 

Proposition  5.46.  For  all  (p  £  Diff(M),  we  have 

scal(p*gdvolq,*g(p*or  =  (p*  (scalgdvolg}0r). 

In  particular,  the  Einstein-Hilbert  Lagrangian  n-form  scalgdvolg,  where  dvolg  = 
dvolg  0r,  is  invariant  under  all  orientation  preserving  diffeomorphisms  ofM: 

scalqtgdvolytg  =  (p*  (scalgdvolg). 

Proof.  For  all  diffeomorphisms  (p  of  M  we  have 

dvolv*g(p*or  =  (p*  dvolg0r  and  scal^g  =  (p*scalg. 


□ 


The  Einstein-Hilbert  Lagrangian  can  be  generalized  by  considering  instead 

J^(fg)  =  J^(scal  +2  A)dvolg,  (5.38) 

where  A  is  a  constant  known  as  the  cosmological  constant.  The  proof  of  The¬ 
orem  5.45  does  also  show  the  following. 

Theorem  5.47.  The  equations  of  motion  of  the  Einstein-Hilbert  Lagrangian  with 
cosmological  constant  (5.38)  are  equivalent  to 

1 

Ric  —  -seal  ■  g  =  Ag.  (5.39) 

This  equation  is  equivalent  to  the  system 
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2  —  n  seal 

A  = - seal  and  Ric  = - g. 

2n  n  6 


Notice  that  if  n  <  2,  then  necessarily  A  =  0  and  every  metric  is  again  a  solution. 
If  n  >  3,  then  one  can  show  (exercise)  that  if  g  is  an  Einstein  metric,  that  is 


Ric  =  fg 


for  some  function  /,  then  /  =  scal  =  const.  Thus  g  solves  (5.39)  for  A  =  'kff^scal.  In 
other  words,  the  solutions  of  (5.39)  for  n  >  3  are  precisely  the  Einstein  metrics  such 
that  scal  =  jfy,  A .  Observe  that  given  an  Einstein  metric,  the  equation  scal  =  A 
can  be  always  solved  by  rescaling  the  Einstein  metric  by  a  positive  constant  provided 
that  scal  has  the  same  sign  as  A.  So  it  is  sufficient  to  distinguish  only  3  cases:  A  =  0, 
A  >  0  and  A  <  0. 

To  end  this  section  we  explain  now  how  to  obtain  the  Einstein  metrics  as  solutions 
of  a  variational  problem  without  introducing  a  cosmological  constant.  For  that  we 
consider  the  Einstein-Hilbert  Lagrangian  but  restrict  to  variations  ge  with  compact 
support  (that  is  (5.33)  has  compact  support)  that  are  volume  preserving,  that  is 
dvol„.  =  0. 

e=0 

Theorem  5.48.  The  equations  of  motion  for  the  Einstein-Hilbert  Lagrangian  under 
volume  presen’ing  variations  with  compact  support  are  equivalent  to 

Ric0  =  0,  (5.40) 

where  Ric 0  =  Ric —  ppg  denotes  the  trace-free  part  of  Ric.  A  pseudo-Riemannian 
metric  g  is  a  solution  of  (5.40)  if  and  only  if  Ric  =  fgfor  some  function  f. 

Proof.  Owing  to  (5.34),  we  obtain  (5.40)  by  projecting  (5.39)  to  its  trace-free  part. 

□ 


d_ 

de 


5.5  The  energy-momentum  tensor 

In  the  previous  subsection,  we  discussed  pure  gravity,  that  is,  gravity  in  the  absence 
of  matter  fields.  The  key  ingredient  for  coupling  matter  fields  to  Einstein  gravity  is 
the  energy-momentum  tensor  [13,  14,  19].  It  can  be  introduced  by  reconsidering  two 
important  conservation  laws  of  classical  mechanics  in  the  context  of  classical  field 
theory.  Recall  that  the  reason  for  conservation  of  energy  in  classical  mechanics  is 
the  invariance  of  the  Lagrangian  under  time-translations.  Similarly,  invariance  under 
spatial  translations  implies  conservation  of  momentum. 

As  a  concrete  simple  example  to  illustrate  at  least  some  of  the  features  of  matter- 
coupled  Einstein  gravity,  let  us  start  by  considering  a  first  order  field  theory  for 
maps  /  :  ©  — >  ST  from  some  source  manifold  6  to  some  target  manifold  57 .  Since 
our  considerations  will  be  local,  we  may  restrict,  by  a  choice  of  local  coordinates, 
to  ©  =  HR",  7  =  Rm,  and  we  take  the  standard  volume  form  on  R".  Then  we  can 
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consider  translations  X  =  ^  in  the  source  manifold  0  =  R",  where  are 

constants.  The  vector  field  X  is  an  infinitesimal  automorphism  of  a  Lagrangian 

.5?  G  C°° ( Jet 1  ( R",  Rm ) )  if  and  only  if 

#Ulf)  =-%’(x,f{x),df(x)),  df(x)  :=  (<9M/(x)), 

does  not  explicitly  depend  on  x  =  (x^)  G  R”.  This  follows  from  the  fact  that  the 
prolongation  of  X  =  G  £(R")  is  given  by  pr^pf)  =  X  G  XpetpR",  R'”)). 

This  can  be  seen  either  from  Lemma  5.13  or  by  observing  that  the  action  of  a 
translation  by  a  vector  v  =  ( v G  R"  as  a  diffeomorphism  on  Jet1  (Rn,  Rm)  is  simply 

(x,f{x),df(x))  (x  +  v,f{x),df(x)). 

The  Noether  current  (5.17)  corresponding  to  the  infinitesimal  translation  —  v  is  given 
by 

3  </> 

^  £  dv/V  ^  OV)  - 

We  can  write  it  as  ,T‘  =  v  v  Ty  ,  where 

Tv  ■=  dvr—uxf)-^{j\m  (5.4i) 

ouan 

are  the  components  of  the  energy-momentum  tensor  or  stress-energy  tensor.  (Caveat: 
We  will  redefine  the  energy-momentum  tensor  below.)  As  a  consequence  of  Noether’s 
theorem  (Theorem  5.10)  we  have  the  following  result. 

Proposition  5.49.  If  G  C^petpR",  R'"))  is  invariant  under  translations  in  R”, 
then  the  energy-momentum  tensor  is  divergence-free  (when  evaluated  on  solutions  of 
the  equations  of  motions),  that  is 


Example  5.50.  Consider  the  linear  sigma-model 

# 1/7 )  =  \  -  V (/),  /  :  R"  ->  R“, 


(5.42) 


with  potential  V  G  C°°(R'"),  where  greek  indices  are  raised  with  the  inverse  of  the 
constant  metric  h  =  Y,hfxVdx^ dxv ,  for  example  <9^  =  £/;^v<9v  and  (/;^v)  being  the 
matrix  inverse  to  (h^v).  The  corresponding  energy-momentum  tensor,  written  as 
(0,2)-tensor  field,  is  symmetric  and  given  by 

Tpv  =  ^Sub{f)d^fadvfb  —  J?(jl  f)hnv 


In  index-free  notation  the  right-hand  side  reads 

fg-^h. 
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When  h  =  dt2  —  L«=i  (dxa)2  is  the  Minkowski  metric,  then  the  energy  density  (i.e. 
the  charge  density  associated  with  the  infinitesimal  automorphism  —dr) 

T00  =  Tq  =  \g{dtf,dtf )  +  \  £  g(daf,daf)+V(f), 

z  z  a=l 

' - V - " 

resembles  the  energy  in  mechanics  (and  coincides  with  it  ifn  =1),  if  we  consider  V 
as  potential  energy.  The  non-gravitational  energy  is  in  this  situation  defined  as  the 
spatial  integral  of  the  energy  density: 

E(t)  =  [  T°°(t,xl,...,^~l)dn~lx.  (5.43) 

Jr"-1 

It  is  constant  under  appropriate  boundary  conditions  at  spatial  infinity  ( as  in  The¬ 
orem  5.22).  Similarly,  one  defines  the  momentum  density  as  the  time-dependent 
spatial  vector  field 

n— 1 

E  Ta°da 

a=\ 

on  IR”-1  and  the  momentum  as  the  spatial  integral 

Y(t)  =  nypa(t)da,  Pa(t)=(  ro0(r,xV..,jO<r-1x.  (5-44) 

<^1 

Notice  that  the  momentum  density  (7’“0)a=i,...,n-i  is  the  flux  density  associated 
with  the  infinitesimal  automorphism  —dt.  Its  components  coincide  up  to  sign  with 
the  charge  densities  T®  =  —Ta0  associated  with  the  spatial  translations  —da,  CC  = 
1, —  1. 

It  is  important  to  remark  that  the  physical  notions  of  energy  and  momentum  as 
defined  above  are  only  valid  in  the  Newtonian  limit  of  Einstein  gravity,  that  is,  for 
quasi-static  matter  systems  in  asymptotically  flat  space-times  (see  also  the  discussion 
in  [14,  Sect.  19.3]).  In  general,  the  physical  notions  of  energy  and  momentum  become 
rather  subtle  concepts  in  Einstein’s  theory  of  general  relativity.  In  fact,  no  general 
definitions  of  energy  and  momentum  valid  for  arbitrary  space-time  metrics  exist. 
Only  for  special  classes  of  metrics  there  exist  well-defined  expressions  such  as, 
for  example,  ADM  energy,  Komar  energy,  Bondi  energy  and  Hawking  energy.  For 
further  details  regarding  this  subtle  aspect  of  Einstein  gravity,  the  reader  is  referred 
to  [13,  14,  19]. 

Example  5.51.  Consider  the  Yang-Mills  Lagrangian  ,I/J (j 1 A )  =  -j^tr  (F^F^). 
Differentiating  this  Lagrangian  with  respect  to  d^Ap  (more  precisely,  with  respect  to 
u)p  p  in  the  notation  of  Section  5.4.2)  we  obtain  the  (base-point  dependent )  linear 
map 


0  — >  K,  B^tr(BF^p). 
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Therefore  from  (5.41 )  we  obtain  that  the  corresponding  energy-momentum  tensor  is 
given  by 

rvM  (old)  =  £tr  ((dvAp)F^P) 

(We  are  working  in  a  global  trivialization  of  the  vector  bundle,  such  that  the  endo- 
morphisms  FpV  can  be  considered  as  matrices.)  At  least  in  the  case  of  Abelian  gauge 
groups  ( the  reader  may  try  to  extend  this  to  the  case  of  non- Abelian  gauge  groups), 
this  tensor  differs  from  the  tensor 

Ty  (new)  =  £tr(FvpF^)  ~^(jlA)8^  (5.45) 

which  is  symmetric  with  respect  to  the  metric  h,  by  a  tensor  with  vanishing  divergence. 
In  fact,  for  Abelian  gauge  groups  F^v  =  dpAv  —  dvAp  and  it  suffices  to  subtract 

£tr((dpAv)F/‘P)=£3ptr(AvF^) 

from  Ty  (old).  The  last  equation  holds  because  of  the  Yang-Mills  equation,  which 
for  Abelian  gauge  groups  reduces  to  Y,dpFPp  =  0.  The  tensor  (5.45),  which  differs 
from  the  original  energy -momentum  tensor  by  symmetrization  and  is  still  divergence- 
free,  will  from  now  on  be  called  the  energy-momentum  tensor  and  its  components 
will  be  denoted  by  Ty,  rather  than  Ty  (new).  Notice  that  the  newly  defined  energy- 
momentum  tensor  is  not  only  coordinate  independent  (as  the  previously  defined 
energy -momentum  tensor )  but,  contrary  to  the  previously  defined  one,  is  always 
invariant  under  gauge  transformations. 

In  the  presence  of  gravity,  the  space-time  metric  g  (formerly  denoted  h)  is  con¬ 
sidered  as  a  dynamical  field  of  the  theory.  The  definition  of  the  action  functional  (5.1) 
is  then  modified  to  read 


S\f,g]  =  [  L(jk(f),/(g))dvolg  , 
Jm 


where  dvolg  =  yj  det(g)|  dx1  A  •  •  •  A  dxn  in  local  coordinates  (x 1 , . . .  ,x")  on  some 
open  set  U  C  M  (cf.  (5.37)).  For  physical  reasons,  it  is  important  to  distinguish 
between  the  gravitational  Lagrangian4  Lqr  and  the  matter  Lagrangian  LmaUer.  The 
former  depends  only  on  g  and  at  least  in  ordinary  Einstein  gravity  (as  opposed  to 
modified  gravity  theories)  it  is  taken  to  be  the  Einstein-Hilbert  Lagrangian  L\  \\  = 
j -g^scal  (cf.  (5.31)).  The  matter  Lagrangian  not  only  depends  on  g,  but  also  on  other 
fields  through  their  k-th  order  jets.  In  total,  L  =  Lqr  +  LmaUer. 

One  can  check  that  in  the  above  examples  the  (symmetric)  energy-momentum 
tensor  can  be  obtained  from  the  formula 


Tgv 


8.  if 


v/ldetC?)l  5s^ 


matter 

v 


_ rs  & ^matter  , 

—  ^  ggliv  '  £juv^matter  ’ 


(5.46) 


4  We  remark  that  in  some  texts  the  definition  of  the  Lagrangian  in  gravity  theories  differs  by  a  factor 

of  yjdetjg)!,  namely  =  ly/ldetjgjjz..  This  will  be  used  below. 
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where  \=  Egnv(fi?),  and 


Eg "v 


d 

d, 


d 

d[dxg^v) 


±... 


(5.47) 


denotes  the  Euler-Lagrange  operator  associated  with  the  field  g^v  as  defined  in  (5.3). 
Here  we  are  denoting  the  coordinates  on  the  jet  space  (idiv .if  , . . .)  by  the  same 
symbols  as  their  evaluation  (g^v,  dvg^v , . . .)  on  jk(g _1),  as  customary  in  the  literat¬ 
ure.  Eq.  (5.46)  is  also  often  used  as  the  definition  of  the  (matter)  energy-momentum 
tensor  in  the  physics  literature.  With  this  definition,  the  energy-momentum  tensor  is 
always  symmetric. 

Proposition  5.52.  Let  Jz?  =  Jz?eh  +  -^matter  be  a  gravity  theory  described  as  the 
sum  of  the  Einstein-Hilbert  Lagrangian  for  a  pseudo-Riemannian  metric  g  and  a 
Lagrangian  Jz?ma  tter  which  depends  on  g  and  on  other  fields  through  their  k-th  order 
jets.  Then  the  equations  of  motion  corresponding  to  the  variation  of  the  metric  take 
the  form: 

(^icg  -  \scals^j  =  ~T,  (5-48) 

where  T  =  Y,Tfivdx^  dxv  is  the  energy-momentum  tensor  of  tter  defined  in  (5.46). 


Proof.  The  Euler-Lagrange  operator  associated  with  g-u v  is  \  T  for  the  matter  Lag¬ 
rangian  and  {Ricg  —  j scalgg )  for  the  Einstein-Hilbert  term.  For  the  latter  state¬ 
ment  we  are  using  the  calculation  in  the  proof  of  Theorem  5.45  and  taking  into 
account  that  the  variation  of  the  inverse  metric  is  related  to  the  variation  of  the  metric. 


h  = 


d_ 

de 


ge.by 

£=0 


d_ 

de 


ge 


1  =  ~g  lhg  l. 


e=0 


□ 


Notice  that  the  minus  sign  on  the  right-hand  side  of  the  general  Einstein  equa¬ 
tion  (5.48)  is  usually  absorbed  by  a  sign  change  in  the  definition  of  the  energy- 
momentum  tensor  T. 

The  full  set  of  equations  of  motions  corresponding  to  Jz?  =  =S?eh  + -^matter  is  the 
Einstein  equation  (5.48)  together  with  the  matter  equations  of  motions  coming  from 
varying  Jz?  with  respect  to  the  matter  fields.  Due  to  the  coupling  to  gravity,  the  matter 
equations  of  motions  in  general  also  contain  terms  involving  the  metric,  thereby 
turning  the  full  set  of  equations  of  motions  into  a  system  of  coupled  equations. 

Corollary  5.53.  For  every  solution  of  the  Einstein  equation  (5.48),  the  energy- 
momentum  tensor  (5.46)  of  the  matter  Lagrangian  ^matter  satisfies  the  covariant 
divergence-free  condition  (cf.  Proposition  5.49),  namely 

Ev^v=°> 


(5.49) 
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where  T^v  =  Y,8PP gvaTpo  and  are  the  components  of  the  Levi-Civita  connection 

ofg- 

Proof  This  follows  from  the  equations  of  motion  provided  in  Proposition  5.52  and 
the  fact  that  the  Einstein  tensor  Ricg  —  \scalgg  is  divergence-free,  see  [17,  Ch.  12, 
Lemma  2] .  □ 


Unlike  eq.  (5.42),  eq.  (5.49)  in  general  does  not  correspond  to  a  conservation  law 
in  the  sense  of  Definition  5.9.  The  analog  of  the  quantities  defined  in  eqs.  (5.43)- 
(5.44),  that  is 

JXV\^(g)\T^dav  , 

where  the  integration  is  performed  over  a  spacelike  hypersurface  E,  is  conserved 
only  if  H<3u(y/|det(g)|7’^v)  =  0  holds  rather  than  eq.  (5.49)  (see,  for  example,  [13, 
Ch.  11,  Section  101]  for  further  details).  To  find  a  conservation  law  nevertheless, 
one  needs  to  also  take  into  account  the  contribution  of  the  gravitational  field  (and  its 
derivatives)  to  the  total  energy-momentum.  This  can  be  achieved  by  generalizing  the 
definition  (5.46)  to 


Teff _ 

1  gv  ■ 


2  g.Sf 
Vldet(s)l  8^V 


2 

Vldet(s)l 


VW- 


(5.50) 


An  analysis  along  the  lines  of  [13,  Ch.  11,  Section  101]  shows  the  following  analog 
of  Proposition  5.49  in  the  presence  of  gravity: 

=Y,dn(T^V +tPV)  =  0,  (5.51) 

where  T^v  =  Y.gpp  gvaTpa  is  the  same  as  in  (5.46)  after  raising  the  indices,  and 
t^v  is  called  the  energy-momentum  pseudo-tensor  of  the  gravitational  field.  The 
quantity  f  “  v  is  a  coordinate-dependent  object  that  cannot  be  interpreted  as  the  com¬ 
ponents  of  a  tensor  field.  Note  that  eq.  (5.51)  is  equivalent  to  eq.  (5.49),  with  the 
difference  being  that  the  latter  is  manifestly  covariant.  From  the  above,  we  see  that 
the  distinction  between  energy-momentum  carried  by  the  gravitational  field  versus 
energy-momentum  carried  by  the  matter  fields  becomes  a  subtle  issue  in  general 
relativity.  For  more  details,  the  reader  is  referred  to  [13,  14,  19]. 
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A.l  Exercises  for  Chapter  2 

1 .  Suppose  that  for  every  local  chart  <p  :  U  — >  IR”  on  some  smooth  manifold  M  we 
are  given  a  system  of  n  functions  af  G  C°°(U),  i  =  1, . . .  ,n.  Let  us  denote  by 
Yq  G  C°°(U)  the  components  of  a  smooth  vector  field  Y  on  M  with  respect  to 
the  chart  (p.  Suppose  that  for  every  Y  and  for  every  pair  of  charts  (p  :  U  — >  IR", 
<p  :  U  — >  IR  the  functions  L  (if  Y^  and  £  af  Y~>  coincide  on  U  DU.  Show  that 
there  exists  a  smooth  function  fy  on  M  and  a  smooth  one-form  a  on  M  such  that 
fr\u  =  L and  «| v  =  L af  dx‘  for  every  chart  (p  =  (x1,. . . , x n)  :  U  — >  IR". 
Check  that  a(Y)  =  fy. 

2.  Determine  the  equations  of  motion  of  a  free  particle  in  Euclidean  space  (see 
Example  2.2)  and  find  the  general  solution. 

3.  Determine  the  equations  of  motion  for  the  harmonic  oscillator  (see  Example  2.2) 
and  find  the  general  solution. 

4.  Consider  the  Riemannian  manifold  M  =  (IR"+1  \{0},gCan),  where  gcan  is  the 
restriction  of  the  Euclidean  metric  on  IR"+1.  Denote  by  F  the  cyclic  group 
generated  by  the  homothety  x  i— >  2x. 

a.  Show  that  the  quotient  M  =  M/F  is  diffeomorphic  to  .S'1  x  S"  and  that  the 
Riemannian  metric 

g  ■=  -ygcan,  r(x)  :=  y/{x,x),  x£M, 

induces  a  Riemannian  metric  g  on  M. 

b.  Determine  all  periodic  motions  for  the  Lagrangian  mechanical  system  [M,S£\ 
where  Jz?(v)  =  jg(v,v),  v  G  TM. 

c.  Show  that  (M,g)  is  locally  conformally  flat,  that  is  for  every  p  G  M  there  exists 
an  open  neighborhood  U  C  M  and  a  positive  function  /  G  C°°{U)  such  that  the 
Riemannian  metric  f  ■  g\u  is  flat. 

d.  Does  there  exist  a  global  function  /  G  C°°(M )  such  that  fg  is  of  constant 
(sectional)  curvature? 
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(Hint:  You  may  use  the  classification  of  complete  simply  connected  Riemannian 
manifolds  of  constant  sectional  curvature.  These  manifolds  are  precisely  the 
Euclidean  spaces,  the  spheres  and  the  hyperbolic  spaces.) 

5.  Let  V  be  a  smooth  function  on  a  Riemannian  manifold  (M,g).  Consider  the 
Lagrangian 

L{v)  =  \g(v,v)-V(nv),  v  e  TM,  (A.l) 

where  n  :  TM  —rM  is  the  canonical  projection.  Let  X  be  a  Killing  vector  field  on 
(M,  g)  such  that  X (V)  =  0.  Check  by  direct  calculation  (without  using  Noether’s 
theorem)  that  the  function  v  i— >  g(v,X(nv))  on  TM  is  an  integral  of  motion  of  the 
Lagrangian  mechanical  system 

6.  Show  that  the  automorphism  group  of  the  Lagrangian  mechanical  system  (M,L) 
of  the  previous  exercise  is  given  by 

Aut (M,L)  =  {(p  £  Isom(M)|V o  (p  =  V}. 

Deduce  that  the  infinitesimal  automorphisms  of  (M,L)  are  precisely  the  Killing 
vector  fields  X  such  that  X(V)  =0. 

7.  Let  y :  /  — »  R3  \  {0}  be  a  smooth  curve.  Show  that  the  following  conditions  are 
equivalent: 

a.  7  x  Y  =  0,  where  x  denotes  the  cross  product, 

b.  7  is  a  radial  curve,  that  is  y(7)  is  contained  in  the  ray  R>0v'o  generated  by 
some  constant  vector  vo  €  IR3  \  {0}.  Here  1R>0  denotes  the  set  of  positive  real 
numbers. 

8.  Consider  a  particle  7 : 1  — >  R3  \  {0}  of  mass  m  =  1  moving  in  Euclidean  space 
under  the  influence  of  Newton’s  gravitational  potential  V  =  —  y ,  where  units  have 
been  chosen  such  that  the  gravitational  constant  K  =  1. 

a.  Determine  the  radial  motions  of  the  system. 

b.  Deduce  the  total  fall  time  as  a  function  of  the  initial  radius  if  the  initial  velocity 
is  zero. 

Hints:  Rather  than  calculating  r  as  a  function  of  time  t  you  will  notice  that 
in  general  it  is  easier  to  calculate  t  as  a  function  of  r.  The  function  r  of  t  is 
then  implicitly  detennined  as  the  inverse  function  and  will  not  be  calculated 
explicitly.  The  calculation  ofr  1— ►  t(r)  reduces  to  finding  the  primitive  of  a  function. 
For  that  it  might  be  helpful  to  calculate  the  derivative  of  the  function  F(x)  = 
s/x  +  x2  —  arsinh(^/x)  for  x  >  0. 

9.  (Conservation  of  angular  momentum).  Consider  a  particle  1 1— >•  y(t )  €  IR3  moving 
in  Euclidean  space  according  to  Newton’s  law 


Its  angular  momentum  at  time  t  is  the  vector 
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L(0  :=  j(t)  x  p(f). 
Show  that  L  is  subject  to  the  equation 


where  M  :=  y  x  F  is  the  moment  of  force.  Deduce  that  the  angular  momentum  is 
constant  if  the  moment  of  force  is  zero. 

10.  Let  V  be  a  smooth  function  on  a  pseudo-Riemannian  manifold  (M,  g)  and  consider 
the  Lagrangian  2z?(v)  =  ^ g(v,v )  —V(fiv),  v  £  TM.  Assume  that  with  respect  to 
some  coordinate  system  on  M  the  metric  g  and  the  potential  V  are  both  invariant 
under  rotations  in  one  of  the  coordinate  planes,  say  in  the  (x1  ,x2)-plane.  Show 
that  there  exists  a  corresponding  integral  of  motion  defined  on  the  coordinate 
domain.  How  is  it  related  with  the  notion  of  angular  momentum? 

1 1 .  Show  that  a  function  on  1R3  \  {0}  is  radial  if  and  only  if  it  is  spherically  symmetric, 
that  is  invariant  under  SO(3).  Deduce  that  invariance  under  SO(3)  is  equivalent 
to  invariance  under  0(3). 

12.  Show  that  a  vector  field  on  M  =  IR3  \  {0}  is  radial  if  and  only  if  it  is  spherically 
symmetric,  that  is  invariant  under  the  natural  action  of  SO  (3)  C  Diff(M).  Deduce 
that  invariance  under  SO(3)  is  equivalent  to  invariance  under  0(3). 

Hint:  Recall  that  the  natural  action  of  the  dijfeomorphism  group  Diff(M)  on  the 
vector  space  of  vector  fields  F  :  M  -A  TM  on  a  smooth  manifold  M  is  given  by 

F  1— >■  F V  :=  dtp  0F0  (p~l,  (p  £  Diff(M). 

1 3 .  Let  y :  I  — >  R3  \  {0}  be  the  motion  of  a  particle  in  a  radial  force  field  F  according  to 
Newton’s  law  my"  =F(y).  Recall  that  the  motion  is  planar  due  to  the  conservation 
of  the  angular  momentum  vector.  Show  that  the  area  A(to,t\)  swept  out  by  the 
vector  y during  a  time  interval  [fo,fi]  is  given  by  Afofi)  =  |(f  1  —  to)- 

14.  Let  ( M,g )  be  a  pseudo-Riemannian  manifold  and  /  a  nowhere  vanishing  smooth 
function  on  M.  Consider  the  pseudo-Riemannian  metric 

gN=g  +  fdu2 

on  N  :=M  x  IR,  where  u  is  the  coordinate  on  1R.  Show  that  geodesic  equations  for 
a  curve  t  ha  yN(t)  =  (y(f),«(/))  £  N  =M  x  IR  can  be  separated  into 

a.  the  equations  of  motion  for  y  with  respect  to  a  Lagrangian  of  the  form  Jf(v)  = 
|g(v,  v)  —  V (nv),  v  £  TM,  where  V  is  a  certain  smooth  function  on  M  related 
to  /  and  7t :  T M  — >  M  is  the  canonical  projection  and 

b.  an  ordinary  differential  equation  for  the  function  t  — »  u(t),  which  can  be  solved 
by  integration  once  we  know  t  ha  y(f). 

15.  Determine  the  radius  r  as  a  function  of  the  angle  (p  for  a  motion  of  a  particle  of 
unit  mass  with  non-zero  angular  momentum  in  Coulomb’s  electrostatic  potential. 
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Can  you  use  the  results  about  the  motion  in  Newton’s  gravitational  potential? 
What  is  the  main  difference? 

16.  Let  V  be  a  smooth  function  on  a  pseudo-Riemannian  manifold,  which  is  either 

bounded  from  above  or  from  below.  Show1  that  there  exists  a  function  /  related  to 
V  and  a  pseudo-Riemannian  metric  gw  =  g  +  fdit2  on  N  =  M  x  IR,  where  u  is  the 
coordinate  on  the  IR -factor,  such  that  the  solutions  of  the  Lagrangian  mechanical 
system  defined  by  Jz?(v)  =  ^g(v,v)  —  V(ttv),  v  €  TM,  correspond  precisely  to 
geodesics  t  i-a  Jn( 0  =  (7(0;M(0)  in  with  a  particular  choice  of  the  affine 

parameter  t  and  which  satisfy  u'  f  0. 

17.  Let  V  be  a  smooth  function,  which  is  bounded  from  below,  on  a  complete  Rieman- 
nian  manifold  (M,g).  Show  that  the  solutions  of  the  Lagrangian  mechanical 
system  defined  by  2z?(v)  =  jg(v,v)  —  V(nv),  v  €  TM,  exist  for  all  times. 

Hint:  You  may  use  the  previous  exercise  to  relate  the  problem  to  the  completeness 
of  a  Riemannian  manifold  (N,gw)  of  the  type  N  =  M  x  IR,  g #  =  g  +  fdu2.  Recall 
that  a  Riemannian  metric  on  a  smooth  manifold  M  is  called  complete  if  every 
Cauchy  sequence  in  (M,g)  converges  and  that  this  notion  is  equivalent  to  geodesic 
completeness  by  the  Hopf-Rinow  theorem. 

18.  Deduce  from  the  previous  exercise  that  for  every  smooth  function  V  on  a  compact 
Riemannian  manifold  the  solutions  of  the  Lagrangian  mechanical  system  defined 
by  Jz f(v)  =  jg(v,v)  —  V(?rv),  v  £  TM,  exist  for  all  times. 
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19.  Let  /  be  a  smooth  function  on  a  symplectic  manifold  M.  Compute  the  Hamiltonian 
vector  field  Xf  in  a  coordinate  system  (q 1 , . . . ,  qn ,  p  \ , . . . ,  pn )  defined  on  some  open 
subset  U  CM  such  that  (o\u  =£(//?<■  A dq’ . 

20.  Let  (M,2z?)  be  a  Lagrangian  mechanical  system  and  denote  by  n  :  T M  — >  M  the 
canonical  projection.  Show  that  the  following  conditions  are  equivalent: 


a. 

b. 

c. 

d. 


e. 


££  is  non-degenerate. 

0  :  TM  — >  T*M  is  a  local  diffeomorphism. 

For  all  x  CM,  (<)//■  \  t,m  :  TXM  — >  TfM  is  of  maximal  rank. 

For  all  v  C  TM,  there  exists  a  coordinate  system  (x‘)  defined  on  an  open 

neighborhood  U  of  7 tv  such  that  the  matrix  ( invertible,  where 


(q  . .  .q" .q] , . . . , cj" )  are  the  corresponding  coordinates  on  777. 

For  all  v  £  TM,  and  every  coordinate  system  (xl )  defined  on  an  open  neighbor¬ 


hood  U  of  tzv  the  matrix 


m)«' 


invertible. 


21.  Let  (M,Jf)  be  a  Lagrangian  mechanical  system.  Show  that  the  following  condi¬ 
tions  are  equivalent: 


a.  is  nice. 


1  See  [6]  for  results  generalizing  this  exercise  and  also  Exercise  14. 
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b.  :  TM  — >  T*M  is  a  diffeomorphism. 

c.  ££  is  non-degenerate  and  for  all  x  £  M,  (p^ I txm  '■  TXM  — >  T*M  is  a  bijection. 

d.  For  all  x  £  M,  <py\rxM  ■  TXM  — >•  T*M  is  a  diffeomorphism. 

22.  Let  (M,g)  be  a  pseudo-Riemannian  manifold  and  denote  by  <j>  :  TM  — >  T*M 
the  isomorphism  of  vector  bundles  induced  by  g.  Let  V  be  a  smooth  function 
on  M  and  consider  the  Lagrangian  2 if(v)  =  jg(v,v)  —  V(nv),  v  £  TM.  Here 
n  :  TM  -£  M  denotes  the  projection.  Denote  by  E  £  C°°(TM )  the  energy  and  by 
H  =  E  o  0-1  £  C°°(T*M)  the  Hamiltonian. 

a.  Show  that  if  a  curve  y :  I  — >  T*M  is  a  motion  of  the  Hamiltonian  system 
( T*M ,  0).H).  then  the  curve  K  o  y  :  /  — >  M  i s  a  motion  of  the  Lagrangian 
system  (M,2zf).  Here  ft)  denotes  the  canonical  symplectic  form. 

b.  Show  that  the  map  y  i — >■  0  o  /  from  curves  in  M  to  curves  in  T*M  is  inverse  to 
the  map  y  i— >  n  o  y  from  curves  in  L*M  to  curves  in  M  when  restricted  to  solu¬ 
tions  of  the  Euler-Lagrange  equations  and  Hamilton’s  equations,  respectively. 
Here  n  :  T*M  — >  M  denotes  the  projection. 

23.  State  and  prove  a  local  result  relating  Lagrangian  mechanical  systems  with  non¬ 
degenerate  Lagrangian  to  Hamiltonian  systems,  similar  to  the  global  result  proven 
in  Section  3.2.2  for  nice  Lagrangians. 

24.  Let  /  be  a  smooth  function  on  a  finite-dimensional  real  vector  space  V  such  that 
(j)f  :  V  — ►  V*  is  a  diffeomorphism  and  consider  its  Legendre  transform  /  £  C°°(V*). 
Show  that  (pf  :  V*  — >  V  is  also  a  diffeomorphism  and  that  the  Legendre  transform 
of  / is  /. 

25.  A  subspace  U  C  V  of  a  symplectic  vector  space  (V,  ft))  is  called  isotropic  if 
JJ  C  U Show  that  the  maximal  dimension  of  an  isotropic  subspace  is  n  =  \  dim  V 
and  that  the  isotropic  subspaces  U  of  dimension  n  are  Lagrangian,  that  is  U  =  f/x. 
Deduce  that,  with  the  definitions  given  in  Chapter  4,  an  immersed  submanifold  of  a 
symplectic  manifold  is  Lagrangian  if  and  only  if  its  tangent  spaces  are  Lagrangian. 

26.  Let  ( T*M,H )  be  a  Hamiltonian  system  of  cotangent  type,  n  =  dim M,  and  let 
S:MxU  -£  (R  be  a  smooth  n-parameter  family  of  solutions  of  the  Hamilton- Jacobi 
equation.  Show  that  the  following  conditions  are  equivalent. 

(i)  The  family  is  non-degenerate. 

(ii)  The  map  <P$  :  M  x  U  — >  T*M.  defined  in  Definition  4.7,  is  a  local  diffeomorph¬ 
ism. 

(iii)  For  all  x  £  M, 

<&s\{x}xu  '■  {*}  x  U  =  U  — >■  T*M,  u  i  t  dS"\x , 

is  a  local  diffeomorphism. 

(iv)  For  all  x  £  M, 

^s|{*}x£/ :  W  xU  =  U  -»■  TX*M, 

is  of  maximal  rank. 

(v)  For  all  (x,u)  £  M  x  U  the  n  x  n-matrix 
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is  invertible,  where  (x!)  are  local  coordinates  in  a  neighborhood  of  x  £  M  and 
(«')  are  (for  instance)  standard  coordinates  in  U  C  IR". 

27.  Let  (. M,g )  be  a  Riemannian  manifold  and  let  xq  £  M  be  an  equilibrium  point  of  a 
given  Lagrangian  mechanical  system  _S?(v)  =  ^g(v,  v)  —  V  (jtv).  Further  assume 
that  xo  is  a  local  minimum  and  a  non-degenerate  critical  point  of  V.  The  latter 
means  that  the  Hessian  of  V  with  respect  to  the  Levi-Civita  connection  at  xq 
is  positive  definite.  Show  that  xq  is  a  stable  solution  in  the  sense  of  Lyapunov 
stability,  i.e.  show  that  Ve  >  0  35  >  0,  such  that  for  all  solutions  y :  I  — t  M,  0  £  /, 
of  the  corresponding  Euler-Lagrange  equations  the  following  holds  true: 


^/dM(xo,r(0))2  +gy(0}(y(0),Y(0))  <  5 

=>  \J dM (*o,  7{t))2  +  gy(t)  (7(0-  7(0 )  <  e  Vf  e  /  , 


where  c/m(v)  denotes  the  metric  on  M  induced  by  g. 

Hints:  Use  the  Morse  lemma  to  bring  V  near  xq  to  a  certain  form.  Verify  that 
for  K  C  M  compact  and  contained  in  a  chart  domain,  one  can  find  c  >  0  and 
C  >  0,  such  that  c(-,  ■)  <  ■)  <  C(-,  •),  where  (q,q)  =  Y,q}.  The  topology  of 

M  coincides  with  the  topology  induced  by  dM,  see  [11,  p.  166],  in  particular  V  is 
continuous  with  respect  to  the  metric  dM-  Also,  recall  that  the  energy  is  an  integral 
of  motion,  hence  estimating  E(y{  0))  yields  a  global  result  for  y. 

28.  a.  Consider  a  Lagrangian  mechanical  system  in  IR3  with  radial  potential  Jz?(i')  = 
ij  (v,  v)  —  V (r)  and  assume  that  r(t)  =  r$  is  the  constant  radial  component  of  a 
motion  with  |L|  >0.  Recall  the  corresponding  expression  for  the  energy 

E  =  ^f2  +  yeff(r) 


and  show  that  such  a  solution  is  stable  in  the  sense  of  the  previous  exercise  if 


d2V 

dr2 


(ro)  + 


3  dV 
ro  dr 


(r0)  >  0. 


b.  Consider  V (r)  =  —  a  >  0.  Find  all  n  £  INI  such  that  a  stable  motion  r(t)  =  ro 
exists.  Does  the  answer  depend  on  a?  (Note  that  this  question  is  related  to 
Newtonian  gravitation  in  d  =  n  +  2  dimensions.  To  show  this,  one  needs  to 
use  a  conservation  law  similar  to  the  angular  momentum  which  was  used  in 
dimension  three.) 

c.  For  which  b  >  0  does  the  Lagrangian  system  corresponding  to  the  potential 
V{r)  =  —  “  exp  (—1),  ot  >  0,  have  stable  motions  with  r(t)  =  ro?  Does  the 
answer  depend  on  a? 

d.  For  which  a  does  the  Lagrangian  system  corresponding  to  the  potential  V (r)  = 
oTn(r)  have  stable  motions  with  r(f)  =  ro? 
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29.  Let  <7* , . . .  ,qn,ql , . . .  ,qn  denote  the  standard  coordinates  on  TEn  .  Consider  the 
function  V  :  [R  \  {0}  — >  R  defined  by 

where  Vo,xo  £  R\  {0}  are  constants.  The  Lagrangian  system 

%=  l{q,q)~ 

z  i=  1 

describes  small  oscillations  in  a  linear  chain  of  n  atoms  of  unit  mass.  Note  that 
the  oscillations  are  assumed  to  be  in  direction  of  the  chain. 

a.  Determine  the  distance  a  of  two  neighboring  atoms  in  the  equilibrium  position. 

b.  Show  that  the  Taylor  expansion  up  to  second  order  of  V  at  a  is  given  by 

V(x)  =  ~  +  ^k(x-a)2 

withfe=i|- 

c.  Determine  the  equations  of  motion  corresponding  to  the  linearized  problem 

■^2=  E^,+1  -?')• 

Z  i—  1 
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Consider  a  point  q  on  a  conic  section  CcR2  with  focal  points  f\  ^  fi-  Show  that 
the  tangent  line  at  q  bisects  the  angle  formed  by  the  two  lines  connecting  q  with 
the  focal  points. 

Consider  a  hyperbola  H  C  R2  with  focal  points  f\ ,  /o  at  distance  2c  >  0.  Recall 
that  H  is  defined  by  the  equation  |n  —  ri  \  =  2 a,  where  r i  and  r 2  are  the  distances 
to  f\  and  /2,  and  a  £  (0,  c)  is  a  constant.  Show  that  the  restriction  of  the  Euclidean 
metric  to  H  is  given  by 


E,2  —4  a2 
4(^2 -4c2) 


dE, 


2 


where  E,  =  r\+r2-  (This  result  is  used  in  the  proof  of  Proposition  4.1 1.) 
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32.  Consider  the  Lagrangian  J£(q,q)  =  n?=i  ql  on  IR”  and  let  n  >  2.  Show  that  the 
restriction  of  <j)j?  :  TRn  — >  T*  R"  to  the  open  subset 


Ua  =  {(q,q)  G  LIR”  |  >  0}  C  TR'\ 


where  a  £  {+,—},  is  a  diffeomorphism  onto  its  image.  Check  that  (f>/y(U { )  = 

-  )  if  n  is  odd  and  that  0_$?(t/+)  and  (f)y-  ( U  )  are  invariant  under  the  an¬ 
tipodal  map  in  the  fibers  of  T*Rn  if  n  is  even.  Determine  the  corresponding 
Hamiltonians  on  (j)//'(U0)  and  find  the  general  solution  of  the  corresponding 
Hamilton’s  equations. 

33.  Show  that 

dimJeto(IR", IR)  =  ^ 

34.  Show  that  for  every  smooth  manifold  there  are  canonical  identifications 


a. 

Jet1  (Af ,  IR)  =  IR  x  T*M , 


b. 


Jet1  (IR,Af)  =  IR  x  TM. 


35.  Let  P  =  (Pl ,P")  be  a  smooth  vector-valued  function  on  Jet*(IR",  IRm).  Its  total 
divergence  DivP  is  a  smooth  function  on  Jet1+1(IR”,(Rm).  We  know  from  Proposi¬ 
tion  5.5  that  the  Euler  operators  Ea,  acting  on  functions  on  Jet^1  (IR",  IRm),  vanish 
on  DivP.  Check  this  in  the  following  cases  by  directly  computing  £,„(Div  P): 


a.  k  =  0, 

b.  k  =  m  =  n  =  1. 

36.  a.  Show  that  the  Euler-Lagrange  equations  of  the  free  scalar  field  Lagrangian 
on  the  H-dimensional  Minkowski  space  ( M,g )  =  (IR",  ( dx °)2  —  \(dx1)2), 

given  in  index  notation  by 


&  =  ^(pd^q)-  ^m2(p2, 

yield  the  Klein-Gordon  equation  (□  +  m2)(p  =  0. 
b.  Verify  that  in  the  above  case  □  =  (— l)"”1  -kd kd,  where  *  denotes  the  Hodge 
star-operator,  defined  by 


a  A*/I  =  ( a,P)dvol , 

where  a,/3  are  differential  forms  on  n-dimensional  Minkowski  space  and 
dvol  is  the  metric  volume  form  of  the  Minkowski  metric  (dx0)2  —  £"“/  (dx1)2 
compatible  with  the  standard  orientation. 

37.  Consider  the  Lagrangian  of  the  Kepler  problem  with  Newton’s  potential  in  IR3, 
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J?(v)=l-(v,v)+™. 

Show  that  each  entry  of  the  so-called  Runge-Lenz  vector  (or  Laplace-Runge-Lenz 
vector) 


is  an  integral  of  motion. 

38.  For  a  given  fgN,  determine  the  Euler-Lagrange  equations  for 

2=\  I  Uj,  ^GC“( Jet*(R",R)). 
z  \A<k 

39.  Let  (M,g)  and  (N,h)  be  pseudo-Riemannian  manifolds  of  dimension  m  and 
n,  respectively.  We  assume  that  (N .h)  is  oriented  and  denote  its  volume  form 
by  dvolh ■  Determine  the  Euler-Lagrange  equations  for  the  Lagrangian  n-form 
Sfdvolh  defined  by 


&{jlf)  =  \(df,df),  feCT(N,M), 

where  (•, -)x  is  the  scalar  product  on  T*N  0  TjgxjM,  x  £  N,  induced  by  h  and  g. 
Show  that  the  resulting  equations  are  equivalent  to 

tr/i  Vc//  =  0, 

where  V  is  the  connection  on  the  vector  bundle  T*N  ®  f*TM  — >  N  induced  by 
the  Levi-Civita  connections  on  M  and  N . 

40.  Let  X  and  Y  be  smooth  vector  fields  on  R"  and  Rm,  respectively,  Z  =  X  +  Y, 
dvol  the  standard  volume  form  of  R",  and  ,1J  £  C“(Jet<:(R",  Rm))  a  Lagrangian. 
Assume  that 

(pr^Z)  (Jf)  +  Jzf  divX 

is  a  total  divergence.  Show  that  the  same  calculation  as  in  the  proof  of  Noether’s 
theorem  can  be  used  to  prove  that 

defined  by  Qa  =  Ya  - 

is  a  total  divergence,  where  X'  and  Y“  are  the  components  of  X  £  X(IR")  and 
Y  £  X(Rm),  respectively. 

41.  Let  (M,g)  and  (N.  h)  be  pseudo-Riemannian  manifolds  and  V  £  C°°(N  xM). 
Consider  the  Lagrangian 

^(//)  =  \(df,df)-VU°f),  f  £  C°°(N  ,M). 


A  Exercises 


a.  In  the  special  case  when  (N,h)  is  a  pseudo-Euclidean  vector  space,  show  that 
Jjfdvol/,  is  invariant  under  the  group  of  translations  of  the  source  manifold  N  if 

V  £  C°°(M)  C  C°°(N  x  M)  and  compute  the  corresponding  Noether  currents. 

b.  In  the  special  case  when  (. M,g )  is  a  pseudo-Euclidean  vector  space,  show  that 
Jjfdvoli,  is  invariant  under  the  group  of  translations  of  the  target  manifold  M  if 

V  £  C°°(N)  C  C°°(N  x  M)  and  compute  the  corresponding  Noether  currents. 

42.  Let  FI  C  IR"-1  be  a  bounded  domain  with  smooth  boundary.  We  consider  a 
Lagrangian  Jz?  £  C°°(Jet4(l/,  IR'"))  on  the  cylinder  U  =  R  x  FI  C  R"  with  standard 
coordinates  (x0,*1, . . .  ,x"_1)  =:  (f,x).  Let  P  :  Jetf(R”, IRm)  — »  R"  be  a  smooth 
vector-valued  function  such  that 

Div/*!  jet^-t-i  (£/;iRm) 

is  a  conservation  law  of  Jifdvol,  where  dvol  is  the  standard  volume  form  of  U. 
Let  /  £  C°°(U,  IR'")  be  a  solution  of  the  Euler-Lagrange  equations  which  extends 
smoothly  to  a  neighborhood  of  the  closure  of  U.  We  decompose  the  current 
J  =  P(/f)  =  (7°.J)  :  U  IR"  into  the  charge  density  J°  and  the  flux  density  J. 
Show  that  the  time  evolution  of  the  charge  Q(t)  —  In  J°(t,x)dn  1x  is  given  by 

Q'(t)  =  ~  f  (J ,v)dvoldn, 

J  dQ 

where  v  stands  for  the  outer  normal  of  d£2  and  dvolrlo  for  the  induced  volume 
form  on  d£2  C  IR"  1 .  Conclude  that  the  charge  is  constant  if  the  flux  density  is 
tangent  along  the  boundary  of  Q . 

43.  Consider  Yang-Mills  theory  on  a  Hermitian  line  bundle  over  Minkowski  space 
(. M,g )  =  (IR 4 ,dt2 -~Yla=i(dxa)2).  Since  the  Lie  algebra  of  U(l)  is  one-dimen¬ 
sional,  we  can  identify  the  curvature  F  =  Fy  with  an  ordinary  real-valued  2- 
form.  Show  that  the  Yang-Mills  equation  d*F  =  0  reduces  to  half  of  Maxwell’s 
equations  in  the  vacuum,  that  is 


divE  =  0, 


rotB  = 


whereas  the  Bianchi  identity  d F  =  0  reduces  to  the  other  half  of  Maxwell’s 
vacuum  equations,  that  is 


divB  =  0, 


rotE  =  —  — B. 
at 


where  E  =  Y.Eada  and  B  =  Y.Bada  are  the  time-dependent  vector  fields  on  (R3 
obtained  by  decomposing  F  into 


Ea=F{duda)  and  Ba  = -F(dp,dy), 
where  (a, j3,y)  runs  trough  cyclic  permutations  of  {1,2,3}. 
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44.  Let  V  be  a  G-connection  in  a  vector  bundle  E  and  a  a  one-form  with  values  in 
q(E).  Show  that  the  curvature  of  the  G-connection  V  +  a  is  given  by 

Fv+a  =/rv_|_c/va_|_a/\0;i 


45.  Express  the  Yang-Mills  equations  in  local  coordinates. 

46.  Check  by  direct  calculation  that  the  Noether  current  associated  with  an  infin¬ 
itesimal  gauge  transformation  as  stated  in  Proposition  5.44  is  conserved,  that  is 
divergence-free. 

47.  Prove  that 

d 

dvolge  =  tr(g  h)dvolgl 

£— 0 


de 


for  every  smooth  family  of  pseudo-Riemannian  metrics  gE,  where  g  =  go  and 

ge- 


h=le 


£— 0 
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-connection.  61 
-frame,  61 
-reduction,  6 1 

Action,  6,  45 

Angular  momentum,  12,  17 
Conservation  of,  1 2 

Anti-self-duality  equation,  see  Donaldson 
theory 

Automorphism,  1 1,  50 
Group  of,  65 

Infinitesimal,  11,  51,  54—57,  74 

Bianchi  identity,  62 

Canonical  symplectic  form,  20 
Charge,  see  Electrostatic  theory 
density,  57 

Classical  field  theory,  45 
Configuration  space,  5 
Connection  one-form,  67 
Conservation 
law,  5 1 

of  angular  momentum,  12 
of  energy,  10,  23,  72 
of  momentum,  12,  72 
Conserved  current,  see  Noether  current 
Coordinate 

Darboux,  20,  21 
Induced,  7 
Radial,  12 

Cosmological  constant,  7 1 
Coulomb 

constant,  see  Electrostatic  theory 
potential,  see  Electrostatic  theory 
Covariant  derivative,  9 


Current,  57 
Darboux 

coordinates,  20,  21 
theorem,  see  coordinates 
Derivative 
Covariant,  9 
Total,  47 

Diagonalization,  33 
Donaldson  theory,  66 

Eigen-frequency,  see  Characteristic  frequency 
Eigen-oscillation,  see  Characteristic  oscillation 
Einstein 

equation,  76 
metric,  72 

summation  convention,  7 
tensor,  77 

vacuum  equation,  69 
Einstein-Hilbert  Lagrangian,  69 
Electrostatic  theory,  6 
Energy,  6,  23 

Conservation  of,  10,  23 
density,  74 
Kinetic,  5 

Non-gravitational,  74 
Potential,  5 
Total,  6 

Energy-momentum  pseudo-tensor,  77 
Energy-momentum  tensor,  73,  75,  76 
Re-defined,  75 
Symmetric,  see  Re-defined 
Equations  of  motion,  see  Euler-Lagrange 
equations 

Equilibrium  position,  28 
Neutral,  32 
Stable,  32 
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Unstable,  32 
Euler-Lagrange 

equations,  7,  9,  12,  21,  47,  62,  69 
one-form,  8,  48 
operator,  48 

Field,  see  Classical  field  theory 
Flux  density,  57 
Frequency,  32 
Characteristic,  34 

Resonance,  see  Characteristic  frequency 

Gauge 
group,  65 
theory,  66 
transformations,  65 
infinitesimal,  68,  69 
Geodesic 

as  harmonic  map,  60 
equations,  9 
Gravitational 

coupling  constant,  69 

potential,  see  Newton’s  theory  of  gravity 

Hamilton’s  equation,  21,  37 
Hamilton- Jacobi  equation,  39-41 
Time-dependent,  44 
Hamiltonian,  21 
system,  21 

of  cotangent  type,  37,  40,  41 
vector  field,  20 
Harmonic 
function,  59 
map,  59 
oscillator,  6 

Hooke’s  constant,  see  Harmonic  oscillator 

Induced  coordinates,  7 
Integral  of  motion,  10,  21,  24 

Jet,  46 

Kepler’s  Laws,  17 
Kronecker  delta,  33 

Lagrangian,  5,  46 
immersion,  39 
Linearized,  30 
Matter,  76 

mechanical  system,  5 
Nice,  24 

Non-degenerate,  24 
subspace,  39 
Laplace  operator,  60 


Legendre  transform,  27 
Linearization,  30 
Liouville  form,  20 

Manifold 
Source,  45 
Symplectic,  19 
Target,  45 
Mass,  5 

Matter  Lagrangian,  76 
Maxwell  theory,  62 
Minkowski 
metric,  74 
space,  62 
Momentum,  2 
Angular,  12,  17 
Conservation  of,  12 
density,  74 
Motion,  6,  8,  21 

Integral  of,  10,  21,  24 
Planar,  13 
Planetary,  17 
Multi-index,  46 

Newton 

constant,  see  theory  of  gravity 
theory  of  gravity,  6,  15 
Noether 
charge,  57 
current,  57 

theorem,  11,51,  57,  73 
for  time-dependent  systems,  55,  56 
Normal  mode,  see  Characteristic  oscillation 

Oscillation 

Characteristic,  34 
Small,  32 

Particle 
Free,  6 
Point,  5 
Period,  32 
Phase  space,  21 
Planetary  motion,  17 
Potential,  5 

Coulomb,  see  Electrostatic  theory 
Effective,  14,  15 

Gravitational,  see  Newton’s  theory  of  gravity 
Radial,  13 
Prolongation,  5 1 

Quadratic  approximation,  30 

Radial 
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coordinate,  12 
function,  13 
potential,  13 
vector  field,  13 
Ricci  curvature,  70 


Spherical,  13 
Symplectic 
form,  19 
manifold,  19 
vector  space,  19 


Saddle  point,  32 
Scalar  curvature,  69 

Self-duality  equation,  see  Donaldson  theory 
Sigma  model,  59 
Linear,  60 
Sigma-model 
Linear,  73 
Solution 

of  the  classical  field  theory,  47 
Space-time,  46 
Spatial  translations,  72,  74 
Stability,  32 

Stress-energy  tensor,  see  Energy-momentum 
tensor 

Symmetry,  see  Automorphism 


Tension,  59 
Time,  5 

translations,  56,  72 
Total  derivative,  47 
Total  divergence,  48,  49,  51 

Volume 

element,  see  form 
form,  46,  50,51,57,71 

Yang-Mills 
connection,  66 
equations,  62 
lagrangian,  61,  63,  74 
theory,  61 


